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Expression of Z brafish Bone Morphogenetic Protein 4 
BACKGROUND OF THE INVENTION 

[0001] The bone morphogenetic protein (BMP) family belongs to. the 
transforming growth factor-p (TGF-p) superfamily and includes a group of closely 
related polypeptides identified initially by their capacity to stimulate ectopic bone 
formation in vivo. BMPs are synthesized as large precursor proteins before being 
processed and proteolytically cleaved to form mature carboxyl-terminal dimers. 
BMP members, such as BMP1 to BMP14, have been known to have different 
levels of bone morphogenetic activity. For example, BMP2 and BMP4, which are 
expressed by osteoblasts as they differentiate, , have been shown to stimulate 
osteoblast differentiation and bone nodule formation in vitro. In addition, 
recombinant BMP2 and BMP4 can induce new bone formation when injected 
locally into the subcutaneous tissues of rats. 

[0002] BMPs transduce signals through binding cooperatively to both Type I 
and Type II receptors, which are trans-membrane serine-threonine kinase 
receptors. Transphosphorylation of the Type I receptor by the Type II kinase in 
the cytoplasmic domain triggers a downstream signaling cascade. However, little 
is known about signal transduction involved in BMP signaling pathways. 
Effectors (e.g., Mad proteins), which responded downstream to BMP signals, have 
recently been found in human and Xenopus tissues. 

[0003] Bone morphogenetic protein 4 (BMP4) is a member of the BMP family 
and, like other BMPs, is a multifunctional regulator during vertebrate development. 
BMP4 has been shown to play important roles in the establishment of the basic 
embryonic body plan (e.g., mesoderm formation, left-right asymmetry, dorsal- 
ventral patterning in vertebrates), in morphogenesis (e.g., skeletal development 
and limb patterning), and in the development of organs and tissues (e.g., the 
development of kidney, lung, heart, teeth, gut, and skin, and formation of the 
central and peripheral nervous system, etc.). In fact, the expression of the bone 
morphogenetic proteins and their receptors has been identified in a large variety 
of cells, tissues, and organs, and in specific temporal and spatial patterns. 
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[0004] Mechanisms regulating the expression of bmp genes in vivo are still 
largely unknown despite the identification of two mouse BMP4 transcripts and 
cloning of a mouse BMP4 gene. In addition, two human BMP4 transcripts have 
been identified and two human BMP4 promoter regions have been cloned. The 
two mouse BMP4 transcripts result from two alternative 5'non-coding exons, 1A 
and 1B in the BMP4 promoter region. It was found that 1A promoter is primarily 
utilized in bone cell cultures, and a chicken ovalbumin upstream-Transcription 
Factor I (coup-TFI) was demonstrated in vitro to negatively regulate murine BMP4 
1A promoter in fetal rat calvariae cells. Further, various transcripts resulting from 
several promoters have been observed for a BMP4 homologue in Drosophila 
melanogaster, decapentaplegic protein (dpp). The use of diverse and separate 
promoter regions for one BMP4 gene in different cells derived from different 
tissues suggests a cell-specific or tissue-specific regulation of BMP4 gene 
expression. Given the unstable half-life of most BMP4 transcripts, expression of 
bmp genes is largely regulated at the transcriptional level. 

[0005] Although considerable efforts have been focused on the study of BMP4 
function during zebrafish development, the molecular mechanisms regarding the 
expression of zebrafish BMP4 remain unclear. In contrast to human and murine 
BMP4, a single transcript has so far been identified for the zebrafish BMP4 gene. 
The finding and the materials and methods disclosed in the present invention 
suggest promoter structure, intron/exon organization, and cell-specific and/or 
tissue-specific regulation of zebrafish BMP4 gene expression are different from 
human and murine BMP4 despite high level of amino acid sequence homology 
among BMP4 proteins from humans, mice, and zebrafish. 

[0006] Therefore, there is a need to understand the regulation of zebrafish 
BMP4 expression, to provide further insights into molecular mechanisms, 
regulatory DNA sequences, and transcription factors that regulate development of 
various BMP4-expressing tissues and organs, and to identify molecular 
compounds/substances that induce or inhibit the expression of zebrafish BMP4 
expression. 

[0007] Recently, transgenic technology using various reporter genes, e.g., 
green fluorescent protein (GFP), has provided a powerful means to study gene 
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function and the regulation of gene expression. Thus, there is a need to provide 
cell lines and transgenic fish to allow real-time imaging of various morphogenetic 
processes in different cells, organs, tissues, and during embryogenesis. 

SUMMARY OF THE INVENTION 

[0008] The invention generally provides compositions, transgenic fishes, 
methods, and cell lines involved in the expression of bone morphogenetic protein 
4 (BMP4). In one embodiment, the invention provides isolated DNA molecules for 
a zebrafish bone morphogenetic protein 4 gene (BMP4), its promoter regions, the 
proximal and distal regulatory regions, and enhancer sequences. The invention 
provides isolated DNA molecules including nucleic acid sequences of SEQ. ID 
NO. 1, SEQ. ID NO. 2, SEQ. ID NO. 4, and derivatives and fragments thereof. 
The amino acid sequences deduced from the sequences of the isolated DNA 
molecules are included in SEQ. ID NO. 3, SEQ. ID NO. 5, SEQ. ID NO. 6, and 
derivatives and fragments thereof. 

[0009] In another embodiment, the invention provides isolated DNA molecules 
for tissue-specific expression. The isolated DNA molecules are utilized to drive 
tissue-specific expression of a gene and provide advantageous tools for 
heterologous gene expression and include nucleic acid sequences of SEQ. ID 
NO. 1, SEQ. ID NO. 2, SEQ. ID NO. 4, and derivatives and fragments thereof. 

[0010] In another embodiment, the invention provides recombinant expression 
vectors containing the DNA sequences of zebrafish bone morphogenetic protein 4 
(BMP4) gene, its promoter regions, the proximal and distal regulatory regions, and 
enhancer sequences, including nucleic acid sequences of SEQ. ID NO. 1, SEQ. 
ID NO. 2, SEQ. ID NO. 4, and derivatives and fragments thereof. The 
recombinant expression vectors can further include DNA sequences for 
heterologous expression products, such as reporter proteins. 

[0011] Various cell zebrafish embryos and adult fishes containing the DNA 
sequences for zebrafish bone morphogenetic protein 4 (BMP4) gene, promoter 
sequences of the zebrafish BMP4 gene, the proximal and distal regulatory regions, 
and the expression vectors of the present invention are provided for studying the 
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expression of BMP4 gene. The zebrafish embryos and adult fishes of the 
invention can further include DNA sequences for heterologous expression 
products. 

[0012] In another embodiment, the invention provides transgenic fish, such as 
transgenic zebrafish containing isolated DNA molecules integrated into zebrafish 
genomic chromosomes, wherein the isolated DNA molecules include zebrafish 
bone morphogenetic protein 4 (BMP4) gene, promoter sequences of the zebrafish 
BMP4 gene, the proximal and distal regulatory regions, and derivatives and 
fragments thereof. The transgenic fish can further include DNA sequences for 
heterologous expression products. 

[0013] In yet another embodiment, the invention provides a method for 
identifying a potential agent, compound, regulator, and/or transcription factor that 
regulates bone morphogenetic protein-4 expression. The method includes 
introducing into a cell a recombinant expression vector containing DNA 
sequences for zebrafish bone morphogenetic protein 4 (BMP4) gene, promoter 
sequences of the zebrafish BMP4 gene, the proximal and distal regulatory regions, 
and derivatives and fragments thereof. The method further includes contacting 
the cell with a candidate compound, and monitoring the expression level of the 
heterologous expression product to obtain an altered expression level in the 
presence of the candidate compound and identify the candidate compound as the 
potential agent. 

[0014] In yet another embodiment, a method for identifying a potential agent 
for zebrafish tissue-specific expression includes introducing into a cell a zebrafish 
tissue-specific expression sequence operatively linked to a nucleotide sequence 
for a heterologous expression product. The method further includes contacting 
the cell with a candidate compound and monitoring the expression level of the 
heterologous expression product in the presence of the candidate compound to 
identify the candidate compound as the potential agent regulating tissue-specific 
expression in zebrafish. 

[0015] In yet another embodiment, a method for screening an effecter (e.g., 
extracellular stimuli, inhibitors, compounds, or agents, as well as repressors, 
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activators and others) that regulates bone morphogenetic protein-4 expression is 
provided. An fish embryo or a transgenic fish having a portion of a zebrafish bone 
morphogenetic protein-4 promoter region and/or the proximal and distal regulatory 
regions, operatively linked to a heterologous expression product, is constructed. 
The method includes introducing a foreign DNA from a cDNA library into the 
embryo or transgenic fish of the invention and monitoring the expression level of 
the heterologous expression product for a number of the transgenic cell having 
foreign DNA from the cDNA library. An altered expression level in the presence of 
the foreign DNA as compared to the absence of the foreign DNA indicates the 
foreign DNA encodes the effecter. 

[0016] In yet another embodiment, a method for identifying an expression 
pattern of a zebrafish BMP4 expression sequence is provided. The method 
includes providing a zebrafish BMP4 expression sequence and/or the proximal 
and distal regulatory regions operatively linked to a nucleotide sequence for a 
heterologous expression product, introducing into a cell the zebrafish expression 
sequence, and monitoring the expression level of the heterologous expression 
product, thus identifying the expression pattern (e.g., developmental expression 
pattern, organ-specific, tissue-specific, or cell type-specific expression patterns) 
for the expression of BMP4 gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0017] So that the features of the invention can be understood in detail, a more 
particular description of the invention briefly summarized above may be had by 
reference to embodiments illustrated in the appended drawings. It is to be noted, 
however, that the appended drawings illustrate only certain embodiments of this 
invention should not be considered limiting of its scope, for the invention may 
admit to other equally effective embodiments. 

[0018] Figure 1 is a simplified schematic of the structural organization of the 
zebrafish BMP4 gene containing exons 1, exons 2, exons 3, exons 4, intron 1, 
intron 2, intron 3, coding regions, 5'-untranslated regions, 3'-untranslated regions, 
promoter regions, and proximal and distal regulatory sequences. 
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[0019] Figure 2 is a simplified schematic of the amino acid sequence deduced 
from the sequence of the cloned zebrafish BMP4 gene. 

[0020] Figure 3 is a schematic diagram of various 5'-deleted BMP4 promoter- 
GFP constructs. 

[0021] Figure 4 is a flow diagram illustrating an exemplary method of 
identifying a potential agent. 

[0022] Figure 5 is a flow diagram illustrating an exemplary method of screening 
a compound that regulates BMP4 gene expression. 

[0023] Figures 6A-6F demonstrate microscopic images for transient expression 
analysis of BMP4 promoter activity in zebrafish embryos injected with the Bgl II- 
GFP construct containing the 7.5-kb BMP4 promoter and regulatory DNA 
sequences. Figures 6A and 6B are the images of lateral view of prim-8 and long- 
pec embryos, respectively, examined under fluorescence microscope in DIC 
mode. Figures 6C and 6D are the images showing GFP expression in the heart of 
prim-8 and long-pec embryos, respectively, which are viewed under fluorescence 
microscope using FITC filter to localize heart-specific expression as indicated by 
fluorescence of GFP protein at the arrow. Figures 6E and 6F are the microscopic 
images after immunohistochemistry and cryostat sectioning showing GFP 
localization in the heart of prim-8 and long-pec embryos, respectively. Arrows 
indicate GFP localization in the heart. Scale bars represent 100 jxm. 

[0024] Figures 7A-7B demonstrate the percentage of GFP expression in a 
zebrafish embryos population for transient expression analysis of BMP4 promoter 
activity using various recombinant GFP constructs containing different lengths of 
the BMP4 promoter and upstream regions. The results of various recombinant 
GFP deletion constructs are shown in Figure 7A for heart-specific expression 
(solid circle) and in Figure 7B for expression in other regions (solid square), such 
as skin and muscle of long-pec embryos. 

[0025] Figure 8 demonstrates co-injection of different DNA fragments into 
zebrafish embryos to direct heart-specific expression in zebrafish. The 
percentages of GFP direct heart-specific expression are compared for long-pec 
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embryos injected with different DNA constructs. The DNA constructs contain 
either Exo III C-GFP, which has minimum promoter activity, alone, together with a 

2.4- kb Bgl W-Hind III DNA fragment, or with a 1.5-kb Age \-Bgl II DNA fragment. 
The results confirm a positive regulatory region within the 2.4-kb Bgl W-Hind III 
DNA fragment that controls heart-specific GFP expression. 

[0026] Figures 9A-9L demonstrate microscopic images of zebrafish transgenic 
F1 embryos for stable BMP4 promoter-GFP expression analysis. Fi embryos from 
a transgenic F 0 line (B cell line) containing the Bgl ll-GFP construct containing the 

7.5- kb BMP4 promoter are used herein. The results confirm that the 7.5-kb BMP4 
promoter direct stable heart-specific GFP expression and such stable and tissue- 
specific transgenic BMP4 expression is obtained from F 0 cell line to Fi progeny 
during zebrafish development. F1 embryos during different stages of zebrafish 
development are examined. Long-pec F1 embryos are shown in Figures 9A, 9D, 
9G, and 9J. Protruding-mouth F1 embryos are shown in Figures 9B, 9E, 9H, and 
9K. 100 hour post fertilization (100-hpf) F1 embryos are shown in Figures 9C, 9F, 
91, and 9L. Figures 9A, 9B, and 9C illustrate lateral view of Fi embryos examined 
under DIC or transmitted light mode. Figures 9D, 9E, and 9F illustrate lateral view 
of Fi embryos examined under fluorescence microscope using an FITC filter. 
Figures 9G, 9H, and 91 illustrate combined confocal images of F^ embryos from 
FITC and bright field modes. Figures 9J, 9K, and 9L illustrate the microscopic 
images after immunohistochemistry and cryostat sectioning showing GFP 
localization in the heart of Fi embryos, specifically in the myocardium of ventricles 
(V). Arrows indicate localization of GFP in the heart. A, atrium; B, bulbus 
arteriosus; V, ventricle. Scale bars represent 100 urn. 

DETAILED DESCRIPTION OF THE INVENTION 

[0027] Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the 
art to which this invention belongs. All publications mentioned herein are 
incorporated by reference to disclose and describe the methods and/or materials 
in connection with the publications cited. In this specification and the appended 
claims, the singular forms "a," "an," and "the" include plural reference unless the 
context clearly dictates otherwise. 
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[0028] The invention provides oligonucleotides, an isolated BMP4 gene and its 
genomic structure, promoter regions, transcriptional regulatory elements, 
expression vectors, and transgenic fish. The invention provides analysis results for 
zebrafish BMP4 gene structure and method to study its developmental expression 
pattern and tissue-specific expression pattern. The invention further provides 
methods to identify regulatory elements (e.g., enhancers, silencer, and the like), 
transcriptional regulators (e.g., repressors, activators and the like), and other 
effectors (e.g. extracellular stimuli, compounds and agents) that contribute to 
BMP4 gene expression in general and for specific organ-specific, tissue-specific, or 
cell type-specific expression. 

I. BMP4 Gene and Isolation of BMP4 Gene 

[0029] We have cloned and identified the zebrafish BMP4 gene. The zebrafish 
BMP4 genomic structure including its 5'-untranslated region (5'UTR), exons 1, 
exons 2, exons 3, exons 4, intron 1, intron 2, intron 3, coding regions, 3'- 
untranslated region (3'UTR), promoter regions, and proximal and distal regulatory 
sequences is shown in Figure 1. As shown in Figure 1, the zebrafish BMP4 gene 
transcription unit contains 4 exons and spans at least approximately 36 kb. There 
are total of 4 exons (exons 1-4) and 3 introns (intron 1-3) for zebrafish BMP4 gene. 
As a comparison, human and mouse BMP4 genes contain 5 exons and span 
approximately 7 kb. The sizes of the introns mainly contribute to the size 
differences of these BMP4 genes. We have found out that the sizes of the three 
zebrafish BMP4 introns, about 6 kb, about 3.5 kb, and about 8.9 kb, respectively, 
are larger than the sizes of four human BMP4 introns (about 1 .9 kb, about 1.1 kb, 
about 1.2 kb, and about 1 kb) and four mouse BMP4 introns (about 2.1 kb, about 
0.8 kb, about 1 kb, and about 1 kb). 

[0030] In Figure 1, two coding exons, exon 3 and exon 4, corresponding to the 
coding regions are shown in gray boxes. Two exons, exon 1 and exon 2, 
corresponding to the 5'-untranslated regions are shown in hatched boxes. Two 
introns, intron 1 and intron 2, are located in the 5'-untranslated region. The 3'- 
untranslated region is shown as a dotted line. Also shown are two restriction 
enzyme sites, Xho I and >Age I, located at the + 1735 and -7263 positions relative 
to the transcription start site (designated as +1), respectively. An arrow also 
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denotes the transcription start site. A proximal regulatory region is shown as open 
box spanning from about +1 to about -7263. A distal regulatory region is also 
identified and shown as mosaic boxes spanning for a length of about 19,528 bp, 
shown as the 19.5 kb distal regulatory region in Figure 1 and located at the 
position from about -7263 to about -26791 . 

[0031] The invention provides SEQ ID No. 4 having a length of about 13,382 
base pairs and including part of intron 2, exon 3, intron 3, and exon 4, shown as 
the 13 kb coding region in Figure 1 . The nucleic acid sequences of SEQ ID No. 4 
described herein have been deposited into GenBank under accession no. 
AF056336. The nucleic acid sequences of exon 3 and exon 4 include the coding 
regions corresponding to two DNA fragments located from position 2,637 to 
position 2,984 and from position 11,948 to position 12,802 of the 13,382 bases 
DNA sequences of SEQ ID No. 4. The deduced amino acid sequences of the two 
coding regions are listed in SEQ ID No. 5 (N-terminal portion) and SEQ ID No. 6 
(C-terminal portion). 

[0032] The two coding regions are connected by intron 3. Nucleotide sequence 
comparison of the identified intron 3 with genes in the sequence databank of 
National Center for Biotechnology Information (NCBI) exhibited a high degree of 
sequence homology with a DANA retroposon element. The DANA retroposon 
element of BMP4 gene is located in the first unusually long intron, intron 3. This 
DANA retroposon element has also be found in various zebrafish genes, such as 
e/F-4E, ependymin, no tail, and major histocompatibility-! I genes (alignment data 
not shown). The homology extends to regions corresponding to four conserved 
DANA boxes and flanking directed repeats of 9-nucleotide sequences 
(GTTTTAATA). Compared to other DANA elements with four conserved boxes, 
the sequence of the DANA retroposon element in BMP4 gene contains boxes #1 
and #4 that are highly conserved among DANA elements. However, boxes #2 
and #3 of the DANA retroposon element in BMP4 gene show lower sequence 
similarity to the sequences of other DANA elements. The conserved boxes #1 
and #4 contain sequences that are similar to the sequences of A box and B box 
internal control regions of tRNA gene promoter. In addition, we have identified a 
pair of 9-nucleotide direct repeats (GTTTTAATA) flanking these four conserved 
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boxes and the results further support that such a DANA-like (Danio retroposon A) 
SINE (short interspersed elements) element is widely distributed in the zebrafish 
genome. Furthermore, the DANA-like SINE element may have been captured at 
a site of DNA breakage as suggested recently by the studies on yeast 
retrotransposon Ty1 element. The question of why such a large intron is present 
in lower vertebrates like zebrafish remains to be answered. 

[0033]^ We have also isolated a 9.0-kb DNA fragment containing a BMP4 
promoter region and adjacent regulatory region, generated by Age I and Xho I 
restriction digestions at the -7263 and at + 1735 positions relative to the 
transcription start site and is shown as the 9kb promoter and 5'-upstream region in 
Figure 1. Thus, the invention provides SEQ ID No. 1 having a length of about 9,100 
base pairs and including part of exon 1, promoter regions, and proximal regulatory 
DNA sequences. The nucleic acid sequences of SEQ ID No. 1 described herein 
have been deposited into GenBank under accession no. AY1 56927. 

[0034] We have determined the transcription start site of BMP4 gene through 
5' Rapid Amplification of cDNA Ends (5' RACE) and found no TATAAT-like core 
promoter element near the transcription initiation site. Thus, zebrafish BMP4 gene 
contains a TATA-less promoter to direct the transcription of zebrafish BMP4 gene 
having a transcript size of about 3,800 bases as revealed by Northern blotting 
analyses. Similarly, mammalian BMP4 genes are also directed by TATA-less 
promoters. Also provided herein is the zebrafish BMP4 mRNA sequence having a 
length of about 1790 bases as shown in SEQ ID No. 2, which can be deduced 
from the DNA sequences of the BMP4 genomic DNA, e.g., SEQ ID No. 4. The 
mRNA sequences of SEQ ID No. 2 described herein have been deposited into 
GenBank under accession no. NIVM31342. 

[0035] The deduced amino acid sequence from the mRNA sequence is shown 
as SEQ ID No. 3, having the complete 400 amino acid sequences of BMP4 
protein. It is observed that the combined sequences of the two amino acid coding 
sequences, SEQ ID No. 5 (N-terminal portion) and SEQ ID No. 6 (C-terminal 
portion), deduced from the genomic DNA sequences of SEQ ID No. 4, are 
identical to the amino acid sequence of SEQ ID No. 3, deduced from the mRNA 
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sequence of SEQ ID No. 2. The deduced amino acid sequence of the coding 
regions of the zebrafish BMP4 gene is also shown in Figure 2. 

[0036] We have found that the zebrafish BMP4 gene encodes a protein of 400 
amino acids, about 8 amino acids shorter in size than mammalian BMP4 proteins. 
Sequence comparison reveals that zebrafish BMP4 protein shares about 73% 
amino acid sequence similarity with human and mouse BMP4 proteins, whereas it 
shares only about 63% amino acid sequence similarity with human and mouse 
BMP2 proteins. Similar to human BMP4 protein, there are 7 conserved cysteine 
residues present in the carboxyl-terminal domain of zebrafish BMP4 protein. 
Dibasic amino acids (RAKR) are present in zebrafish BMP4 protein and are 
located at similar regions in human BMP4 protein. These dibasic amino acids may 
serve as proteolytic cleavage sites for the generation of mature carboxy-terminal 
portions. However, only three potential AMinked glycosylation sites are present in 
the zebrafish BMP4 protein as compared to four glycosylation sites in human 
BMP4 protein. 

[0037] Both 5' and 3' intron-exon splice junctions in the coding regions were 
conserved between zebrafish and mammalian BMP4 genes. However, intron 1 of 
the zebrafish BMP4 gene is much larger than those found in human and mouse 
genes. Intron 1 and intron 2 are located in the 5' untranslated region (5'UTR) with 
sizes of about 6 kb and 3.5 kb, respectively. The invention also provides SEQ ID 
No. 7 having a length of about 3.5 kb and including the DNA sequences of intron 2, 
and SEQ ID No. 8 having a length of about 6kb and including the DNA sequences 
of intron 1 and 122 bp of exon 2. The DNA sequences of exon is included in the 9 
kb DNA fragment containing the BMP4 promoter region and adjacent regulatory 
region. 

[0038] Additional screening of genomic clones has led to the identification of 
other DNA fragments that also contain sequences that may function as BMP4 
promoter regions. For example, DNA sequences spanning about 19.5 kb in the 5' 
upstream regions and being more distal to the 9 kb Age \-Xho I promoter 
region/proximal regulatory region have been isolated. We have analyzed the 
function of this 19.5 kb DNA region and found out that it contains additional 
transcriptional regulatory elements for BMP4 gene expression. Thus, the invention 
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also provides SEQ ID No. 9 having a length of about 19.5 kb that includes distal 
regulatory DNA sequences. 

II. Use of BMP4 Gene 

[0039] The invention provides nucleic acid compositions for BMP4 gene, 
homologs and fragments thereof. Furthermore, the invention also provides coding 
sequences encoding BMP4 polypeptides, homologs and fragments thereof, and 
methods for producing and purifying recombinant BMP4 protein in vitro through 
recombinant DNA technology. In addition, nucleic acid compositions for BMP4 
gene are useful in controlling expression of BMP genes during development and 
identifying chemical compounds, factors, agents, or other substances that affect 
(e.g., stimulate or inhibit) regulation of BMP4 gene expression. 

[0040] The nucleic acid compositions of the invention may encode all or a part 
of the polypeptides for the BMP4 gene. Double- or single-stranded fragments of 
the DNA sequence may be obtained by chemically synthesizing oligonucleotides in 
accordance with conventional methods, by restriction enzyme digestion, by PCR 
amplification, etc. For the most part, DNA fragments will be of at least 15 
nucleotides, usually at least 18 nucleotides or 25 nucleotides, and may be at least 
about 50 nucleotides. Small DNA fragments are useful as primers for PCR, 
hybridization screening probes, etc. Larger DNA fragments, i.e., greater than 100 
nt, are useful for production of a protein or polypeptide. 

[0041] Altered nucleic acid sequences encoding the BMP4 gene may include 
deletions, insertions, or substitutions of different nucleotides resulting in a 
polynucleotide that is the same or a functional equivalent of the BMP4 endogenous 
gene product. Such analysis is useful to study the sequences that are crucial to the 
expression of BMP4 gene and the function of the BMP4 protein. For example, 
altered nucleic acid sequences of the BMP4 gene may be used to generate 
changes in promoter strength or sequences of the encoded proteins. The encoded 
protein may also contain deletions, insertions, or substitutions of amino acid 
residues, which, as an example, produce silent changes and result in functionally 
equivalent BMP4 protein or, as another example, promote a different folding of the 
encoding proteins or decrease substrate fidelity. Deliberate amino acid 
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substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as 
long as the biological activity of the BMP4 gene is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid; 
positively charged amino acids may include lysine and arginine; and amino acids 
with uncharged polar head groups having similar hydrophilicity values may include 
leucine, isoleucine, and valine; glycine and alanine; asparagine and glutamine; 
serine and threonine; phenylalanine and tyrosine. Such alterations to the BMP4 
gene may be made to increase expression, allow for purification, or to add cross- 
linking groups to make the BMP4 protein more reactive and capable of being fused 
to another heterologous gene product. 

A. BMP4 protein Expression 

[0042] In order to obtain the BMP4 protein, cloning of the BMP4 coding 
sequences of the invention into a recombinant expression vector for recombinant 
protein expression may be necessary. A recombinant expression vector may 
contain necessary elements for transcription and/or translation of the inserted 
coding sequences. Recombinant expression vectors and systems known in the art 
may be employed for producing full length or only portions of the BMP4 
polypeptides of the invention. 

[0043] For long-term, high-yield production of recombinant proteins, stable 
expression of the DNA construct of BMP4 protein is preferred. For example, cell 
lines which stably express the BMP4 protein may be transformed using 
recombinant expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the 
same or on a separate vector. Following the introduction of the vector, cells may be 
allowed to grow for 1-2 days in an enriched media before they are switched to 
selective media. The purpose of the selectable marker is to confer resistance to 
selection, and its presence allows growth and recovery of cells, which successfully 
express the introduced sequences. Resistant clones of stably transformed cells 
may be proliferated using tissue culture techniques appropriate to the cell type. As 
another example, a host cell strain may be chosen for its ability to modulate the 
expression of the inserted sequences or to process the expressed proteins or 
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peptides in a desired fashion. Such modifications of the polypeptide include, but 
are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
form of the protein may also be used to facilitate correct insertion, folding and/or 
function. Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, 
which have specific cellular machinery and characteristic mechanisms for such 
post-translational activities, may be chosen to ensure the correct modification and 
processing of the foreign protein. 

[0044] Production of the BMP4 protein may be as insoluble inclusion body 
fusion proteins. For example, expression of BMP4 protein may be toxic to a host 
cell; thus an expression vector for high level-expression of insoluble protein is 
chosen to avoid the expression of soluble active BMP4 protein. Alternatively, 
genomic DNA encoding the mature proteins for BMP4 are produced and isolated 
without signal peptides in order to express the recombinant proteins inside the host 
cells without processing through the secretory pathway of the host cells. 

[0045] In yet another approach, natural, modified, or recombinant nucleic acid 
sequences encoding the BMP4 protein may be ligated to a heterologous sequence 
to encode a fusion protein. For example, it may be useful to encode chimeric 
proteins that can be recognized by commercially available antibodies. A fusion 
protein may also be engineered to contain a cleavage site located between the 
encoding sequences for the BMP4 protein, and the heterologous protein 
sequences, so that the BMP4 protein may be cleaved and purified away from the 
heterologous moiety. 

[0046] With the availability of the protein or fragments in large amounts, the 
recombinant BMP4 protein may be isolated and purified in accordance with 
conventional methods. Again, see Sambrook, J., et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, and Ausubel, F. M. 
et al. (1989) Current Protocols in Molecular Biology, John Wiley & sons, New York, 
N.Y. A lysate may be prepared of the expression host and the lysate purified using 
HPLC, exclusion chromatography, gel electrophoresis, affinity chromatography, or 
other purification techniques. The purified proteins will generally be at least about 
80% pure, preferably at least about 90% pure, and may be up to and including 
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100% pure. Pure is intended to mean free of other proteins, as well as cellular 
debris. 

[0047] For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, humans, and others, may be immunized by injection with the purified 
BMP4 protein or fusion protein obtained or fragments or oligopeptides thereof that 
have immunogenic properties. Depending on the host species, various adjuvants 
may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund f s, mineral gels such as aluminum hydroxide, and surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, and dinitrophenol. 

[0048] In summary, nucleotide sequences of BMP4 proteins can be engineered 
using methods generally known in the art. As a result, altered coding sequences, 
including but not limited to, alterations, which modify the cloning, processing, 
and/or expression of the gene product, are obtained. 

B. BMP4 Gene Expression 

[0049] The BMP4 gene is isolated and its genomic structure is analyzed using 
methods of the invention. A genomic sequence of interest, such as the isolated 
BMP4 genomic sequence, includes nucleic acid sequences present between the 
initiation codon and the stop codon, containing all of the introns that are normally 
present in a native chromosome. The genomic sequences of the invention include 
the 3' and 5' untranslated regions found in the mature mRNA. The sequences also 
include specific transcriptional and translational regulatory sequences, such as 
promoters, enhancers, etc., including about 1 kb to 10 kb or more of flanking 
genomic DNA at either the 5' or 3' end of the transcribed region. Genomic DNA is 
isolated as a DNA fragment of 100 kb or smaller that is substantially free of flanking 
chromosomal sequence. Sequences required for proper tissue-specific and stage- 
specific expression are also cloned from genomic DNA flanking the coding region 
(either 3' or 5') and internal regulatory sequences, such as in introns. 

[0050] Of particular interest is the zebrafish BMP4 gene. SEQ ID NO. 4 
provides the nucleotide sequences of a genomic DNA clone having a size of about 
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13882 bp and containing the zebrafish DNA sequence encoding the full-length 
zebrafish BMP4 protein. The subject BMP4 nucleic acids may be cDNAs or 
genomic DNAs, as well as fragments thereof. The term "BMP4 gene" shall be 
intended to mean the open reading frame encoding BMP4 proteins and 
polypeptides, exons and introns of such genes, promoter regions, its proximal and 
distal regulatory sequences as well as adjacent 5' and 3' non-coding nucleotide 
sequences involved in the regulation of expression, up to about 25 kb beyond the 
coding region, but possibly further in either direction. The gene is then cloned into 
an appropriate vector, such as a plasmid vector, for extrachromosomal 
maintenance or for integration into a host genome. Methods well known to those 
skilled in the art may be used to construct cloning vectors containing appropriate 
transcriptional and translational control elements and DNA sequences. Exemplary 
techniques are described in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
N.Y., and Green, E. et al. (1997) Genome Analysis, A Laboratory Manual, Cold 
Spring Harbor Press, Plainview, N.Y. 

[0051] In order to carry out certain aspects of the invention, primers may be 
used to amplify the genomic or cDNA sequences of the BMP4 gene. For example, 
DNA fragments containing all or portions of the BMP4 coding sequences may be 
used as probes for cloning of other BMP4 genes or additional clones having 
adjacent 5' and 3' non-coding regulatory sequences using hybridization screening 
techniques, PCR amplification/cloning techniques, or others. For instance, 
degenerate primers can be used for cloning of the zebrafish BMP4 gene and 
amplifying BMP4 genes. In addition, specific primer pairs and portions or 
fragments of the nucleic acid sequences of the invention can be used to screen 
and isolate additional BMP4 genomic clones. 

[0052] Also provided are isolated DNA sequences encoding promoter and/or 
proximal regulatory regions of BMP4 gene. For example, SEQ ID No. 1 is provided 
herein as promoter and proximal regulatory regions of BMP4 gene, including 
genomic DNA fragments of about 9,100 bp. 
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[0053] As an example, the sequences of three large genomic fragments (SEQ 
ID No. 7, SEQ ID No. 8, and SEQ ID No 9) in the 5' flanking region are also 
identified and may be modified to effect promoter elements and/or enhancer 
binding sites, to provide developmental regulation in various cells, tissues, and 
organs where expression of the BMP4 gene is desired. Thus, the invention 
provides analyses and methods to identify DNA sequences or DNA regions 
required for controlling gene expression in a cell-, tissue-, or organ-specific manner. 
Such cell type specific controlling expression element is useful for determining the 
expression pattern of the gene, and for providing promoters that mimic the native 
expression pattern. Naturally-occurring polymorphisms in the promoter region are 
useful for determining natural variations in expression, particularly those that may 
be associated with diseases. 

[0054] Alternatively, mutations, deletions, insertions, and substitutions may be 
introduced into various promoter regions to alter the expression of the nucleic acid 
sequence. In addition, methods for the identification of specific DNA motifs involved 
in the binding of transcriptional factors are known in the art, e.g., sequence 
similarity to known binding motifs, gel retardation studies, etc. For examples, see 
Blackwell et al. (1995) Mol Med 1: 194-205; Mortlock et al. (1996) Genome Res. 6: 
327-33; and Joulin and Richard-Foy (1995) Eur J. Biochem 232: 620-626. 
Regulatory DNA sequences that show function in regulating gene expression may 
be used to identify cis acting sequences required for transcriptional or translational 
regulation, such as for the expression of the BMP4 gene, especially in different 
tissues or stages of development, and to identify trans acting factors and/or 
effectors, such as activators, repressor, and the like, which regulate or mediate 
gene expression, as described in detail infra. Such transcription or translational 
control regions may be operably linked to a heterologous gene, such as a reporter 
gene, in order to promote expression of wild type or altered BMP4 genes in 
cultured cells, or in embryonic, fetal, or adult tissues, to generate transgenic fish, 
and for gene therapy. 

[0055] Techniques for in vitro mutagenesis of cloned genes are known. 
Examples of protocols for site-specific mutagenesis may be found in Gustin et al. 
(1993) Biotechniques 14:22; Barany (1985) Gene 37:111-23; Colicelli et al. (1985) 
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Mol Gen Genet 199:537; and Prentki et al. (1984) Gene 29:303-13. Methods for 
site specific mutagenesis can be found in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, CSH Press 1989, pp. 15.3-15.108; Weiner et al., Gene 126:35- 
41 (1993); Sayers et al. Biotechniques 13:592-6 (1992); Jones and Winistorfer, 
Biotechniques 12:528-30 (1992); Barton et al., Nucleic Acids Res 18:7349-55 
(1990); Marotti and Tomich, Gene Anal Tech 6:67-70 (1989); and Zhu, Anal 
Biochem 177:120-4(1989). 

II. Regulation of BMP4 Expression 

[0056] The nucleic acid compositions of the invention are useful in delineating 
the regulation of BMP4 expression, such as the expression of zebrafish BMP4 
mRNA and BMP4 protein during embryonic development, in order to attenuate 
BMP4 expression and identify additional factors and compounds involved using 
screening methods of the invention. For example, a method for identifying an 
expression pattern of a zebrafish BMP4 expression sequence includes providing a 
zebrafish BMP4 expression sequence operatively linked to a nucleotide sequence 
for a heterologous expression product. The zebrafish BMP4 expression sequence 
includes, but is not limited to, DNA sequences for bone morphogenetic protein 4 
(BMP4) gene, such as SEQ ID NO. 1, SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 
8, and SEQ ID NO. 9. The method further includes introducing into a cell the 
zebrafish expression sequence, and monitoring the expression level of the 
heterologous expression product, thus identifying the expression pattern for the 
expression of BMP4 gene. 

A. Expression of zebrafish BMP4 mRNA during embryonic development 

[0057] Expression of zebrafish BMP4 mRNA during zebrafish embryonic 
development may be determined by Northern blot analysis. A mRNA with an 
approximate size of about 3.8 kb is detected at the gastrula stage. The amounts of 
BMP4 mRNA increase to a maximum level at the pharyngula stage and maintain at 
a slightly lower level throughout the month-long larval stage in the presence of an 
equal amount of total RNA loaded for each stage. In contrast, the mammalian 
BMP4 gene has two alternatively spliced mRNAs with much smaller sizes: about 
1 .5 kb and about 1 .7 kb in human, and about 1 .8 kb and about 2.1 kb in mouse. 
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[0058] Because Northern blot analysis is less sensitive than RT-PCR, the tissue 
and organ distribution of zebrafish BMP4 mRNA is also examined by the latter 
technique. The expression of BMP4 mRNA during embryonic development 
determined by reverse transcription polymerase chain reaction (RT-PCR) is similar 
to the mRNA expression pattern from Northern blot analysis, showing BMP4 
mRNA expression from gastrula stage up to 1 -month-old larvae. In addition, 
results from RT-PCR analysis suggest that the BMP4 mRNA is also expressed at 
other developmental stages, e.g., at both the cleavage and blastula stages during 
embryonic development and also at adult stage, as shown by the presence of the 
BMP4 specific RT-PCR product of about 400 base pair. We have further confirmed 
the identity of the amplified RT-PCR DNA through Southern blot analysis using 
BMP4 DIG-labeled DNA probe. By using the a-chain gene as an internal control, 
we have shown that the observed developmental profile of BMP4 mRNA does not 
result from an uneven amount of total RNA used in the RT-PCR reaction. 

[0059] Therefore, the temporal and spatial mRNA expression patterns of 
zebrafish BMP4 are thoroughly analyzed. In zebrafish early development, BMP4 
mRNA is localized in the ventral part of gastrula embryos. During later embryonic 
development, BMP4 expression becomes restricted to certain regions within 
several organ primordia. For example, in pharyngula stage embryos, BMP4 is 
expressed in the olfactory placode, eyes, otic vesicles, heart, pronephric ducts, 
anus, gut, and pectoral and caudal fin buds. 

B. Expression of zebrafish BMP4 protein during embryonic development 

[0060] To examine BMP4 protein expression, as an example, we have first 
overexpressed a BMP4 fusion protein to be used as an antigen for the production 
of antibody. A 9-kD fusion protein is over-expressed from E. coli cells transformed 
with a recombinant plasmid carrying portions of the coding sequences of the 
cloned BMP4 gene. Upon IPTG induction, the 9-kD fusion protein is purified and 
used as an antigen to raise polyclonal antibodies using standard techniques. The 
resulting antisera recognize the 9-kD fusion protein specifically, as represented by 
a strong band after Western Blot analysis. 
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[0061] The specificity of the antisera is further tested against human TGF-/?1 
and BMP2 recombinant proteins. The results demonstrate that the antisera 
recognize with high affinity a human BMP2 recombinant protein of about 17-kD and 
a protein of about 18-kD from cholate-extractable total protein isolated from 11- 
day-old zebrafish hatching larvae. The antisera also recognize a protein of about 
12.5 kD, indicating weak affinity with the human TGF-/?1 recombinant protein. 

[0062] The expression of BMP4 protein is also examined by Western blot 
analysis. Trace amounts of a protein of about 18-kD are detected at the 
pharyngula stage, and increasing level of expression is observed from hatching 
larval stages to adult fish stage during embryonic development. The size of the 
BMP4 protein is consistent with the expected value derived from the number of 
amino acid residues present in the predicted mature protein after cleavage of the 
predicted signal peptide from the preprotein and the presence of one AMinked 
glycosylation site. Several high-molecular-weight immunoreactive bands are also 
observed. They may represent different species of prepropeptides or another 
member of the BMP family (e.g., BMP2), because the fusion protein encompasses 
the most conserved carboxy-terminal domain. 

[0063] Therefore, BMP4 mRNA is abundantly expressed from gastrula stage 
through adult stage, whereas zebrafish BMP4 protein is actively produced from 
pharyngula stage to adult stage. One possible explanation of the discrepancy 
between the expression of the mRNA and the putative mature 18-kD BMP4 protein 
is that the processing efficiency of BMP4 precursors from cleavage stage to 
pharyngula stage is low and thus yields a lower level of mature 18-kD BMP4 
protein during this period. This could be the cause of the observed time lag 
because high-molecular-weight bands are observed from cleavage stage to 
pharynula stage (data not shown). 

[0064] Prominent developmental events from pharyngula to hatching stage 
include the development of jaw, gill, and fin and the establishment of many organ 
rudiments, except for endodermal structures. During the larval period, the 
development of alimentary tracts, gill filament, jawbone, various skeleton, and fin 
rays continues. Thus, the appearance of both BMP4 mRNA and protein during this 
period of zebrafish development implies BMP4 may be required by some of these 
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events. In comparison, mammalian BMP4 has also been shown to play important 
roles in organogenesis. For example, mouse BMP4 mRNA expression was 
suggested to be required for the formation of heart, pituitary gland, limb, 
craniofacial process, and gut. 

[0065] The presence of BMP4 mRNA in adult fish suggests that the function of 
BMP4 is not restricted to organogenesis during embryonic development. The 
expression of BMP4 mRNA was observed in the brain, heart, digestive tracts, 
testes, and jaw. On the other hand, in adult mice, BMP4 mRNA has been detected 
with high levels of expression in spleen and lung, low levels of expression in liver, 
and no expression in brain, heart, and kidney. The level of BMP4 mRNA in these 
organs also increases with age. These results suggest that BMP4 may help to 
maintain the function of various organs and tissues in adult phase. 

IV. Zebrafish BMP4 Promoter Regions 

[0066] The invention provides isolated DNA sequences encoding promoter 
regions and proximal and distal regulatory sequences of BMP4 gene, and a 
method for identifying and evaluating agents, factors, or compounds important for 
BMP4 gene expression. The method is provided for identifying a potential 
extracellular stimulus, inhibitor, agent, compound, substance, regulator, and/or 
transcription factor (e.g., repressors, activators and others) that regulates bone 
morphogenetic protein-4 expression. The method includes introducing into a cell a 
recombinant expression vector containing DNA sequences for zebrafish bone 
morphogenetic protein 4 (BMP4) gene, promoter sequences of the zebrafish BMP4 
gene, proximal and distal regulatory sequences, 5' and 3' non-coding regions, and 
derivatives and fragments thereof. The method further includes contacting the cell 
with a candidate compound, and monitoring the expression level of the 
heterologous expression product to obtain an altered expression level in the 
presence of the candidate compound and identify the candidate compound as the 
potential agent, compound, substance, regulator, and/or transcription factor. 
Examples of the method are also described in Figures 4 and 5 as described in 
section VII and in the Experimental section below. 
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[0067] Further, the activity of the BMP4 promoter and the effect of the proximal 
and distal regulatory sequences are analyzed in zebrafish embryos via transient 
and stable transgenic expression analyses. For example, the results of deletion of 
portions or fragments of the promoter and regulatory sequences provide a basis for 
understanding the mechanism of BMP4 gene expression in general and, 
specifically, BMP 4 gene expression patterns during different developmental stages 
and cell-specific and tissue-specific regulation of BMP4 gene expression. In 
addition, mixing various portions or fragments of the promoter and regulatory 
sequences together through reconstitution, ligation, co-transfection of these 
portions or fragments of the promoter and regulatory sequences is also useful in 
understanding the function of different promoter region and regulatory DNA 
sequences. 

[0068] Significantly, promoter regions and proximal and distal regulatory 
sequences of BMP4 gene are used in a method of identifying potential agents for 
zebrafish tissue-specific expression. First of all, a zebrafish tissue-specific 
expression sequence is identified using methods of the invention, such as deletion 
analyses and reconstitution (e.g., co-transfection, ligation) of important promoter 
and regulatory DNA sequences. Secondly, the method includes introducing into a 
cell the zebrafish tissue-specific expression sequence operatively linked to a 
nucleotide sequence for a heterologous expression product. The method further 
includes contacting the cell with a candidate compound and monitoring the 
expression level of the heterologous expression product in the presence of the 
candidate compound to identify the candidate compound as the potential agent 
regulating tissue-specific expression in zebrafish. 

A. Identification of BMP4 promoter regions 

[0069] We have isolated genomic DNA containing at least one BMP4 promoter 
and its upstream proximal and distal regulatory regions to direct BMP4 gene 
expression in different tissues, cells, and organs. The genomic DNA fragments are 
isolated after screening a zebrafish genomic DNA library. We have obtained 
several genomic clones that contain BMP4 promoter and its upstream proximal and 
distal regulatory regions through screening with BMP4 specific DNA probes having 
a length of about 250 b.p. and about 300 b.p. which are products of PCR 
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amplification using BMP4 specific primers. Positive clones are then examined by 
restriction enzyme mapping, subcloning, and sequencing. 

[0070] The genomic DNA fragments containing putative promoter and the 
upstream proximal and distal regulatory regions can be introduced into various 
expression vectors to direct the expression of a DNA sequences for heterologous 
expression products. The resulting recombinant expression construct includes a 
genomic DNA fragment containing the promoter and/or its upstream proximal and 
distal regulatory regions fused to DNA sequences of a heterologous gene, such as 
a reporter gene for the expression of a reporter protein. For example, genomic 
DNA fragments of zebrafish bone morphogenetic protein 4 (BMP4) gene containing 
its promoter, upstream proximal and distal regulatory regions, 5'-non-coding region, 
introns, and/or enhancer sequences, are identified and provided herein as nucleic 
acid sequences of SEQ. ID NO. 1, SEQ. ID NO. 4, SEQ. ID NO. 7, SEQ. ID NO. 8, 
SEQ. ID NO. 9, and derivatives and fragments thereof. 

[0071] Useful reporter genes are characterized as being easy to transfect into a 
suitable host cell, easy to detect using an established assay protocol, and genes 
whose expression can be tightly regulated. Reporter genes contemplated to have 
utility include, but are not limited to, the luciferase gene, the Green Fluorescent 
Protein (GFP) gene, the chloramphenicol Acetyl Transferase gene (CAT), human 
growth hormone, alkaline phosphatase, p-glucuronidase, and p-galactosidase. 
Additional useful reporter genes are any well characterized genes the expression 
of which is readily assayed, and examples of such reporter genes can be found in, 
for example, F. A. Ausubel et al., Eds., Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, (1989). As will be appreciated by those having 
ordinary skill in the art, the listed reporter genes are only a few of the possible 
reporter genes, and it is only for ease of description that all available reporter 
genes are not listed. 

[0072] The resulting recombinant expression constructs are generally analyzed 
through a series of deletion analyses. In order to define regions responsible for 
constitutive and regulated BMP4 gene expression, the activity of a chimeric GFP 
reporter gene containing different DNA fragments of the 5' flanking regions is 
analyzed in various cell lines, embryos, or cell lines derived from the embryos, for 
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example, microinjection into zebrafish embryos at their 1 -cell or 2-cell stage. 
Other cells or cell lines can also be used. It was found that two large DNA 
fragments of about 9 kb (SEQ. ID NO. 1) and about 28.5 kb (SEQ. ID NO. 1 plus 
SEQ. ID NO. 9) exhibit a high level of constitutive gene expression as judged by 
the constitutive reporter gene expression, e.g., constitutive GFP activity, but in the 
sense orientation only. Endogenous BMP4 expression as represented by the level 
of GFP expression is constitutively expressed in the heart, hatching gland, caudal 
fin, and eye, etc. during zebrafish embryonic development. The recombinant 
expression constructs can also be used to analyze regulated BMP4 expression in 
the presence of a stimulating or inhibiting agent, compound, or transcription factor 
(e.g., activator, repressor, and others). 

[0073] For example, a genomic DNA fragment of about 9kb (SEQ. ID NO. 1 ) is 
cloned into an expression vector, such as an EGFP-ITR vector, as described in 
the Experimental section, to be fused with DNA sequences of a GFP reporter 
gene. Different 5-deleted DNA fragments of BMP4 proximal promoter region and 
upstream regulatory sequences are generated by suitable restriction enzymes and 
exo III deletion. The resulting restriction map of at least seven expression 
constructs containing the heterologous reporter gene generated is shown in 
Figure 3. 

[0074] In Figure 3, a 9.0-kb Age \-Xho I DNA fragment contains a BMP4 
promoter, a proximal regulatory region, and the 5'-noncoding region is shown on 
the top and the nucleic acid positions corresponding to the 5'-end of each deletion 
construct are also shown. The. Seven exemplary deletion constructs fused to a 
reporter gene are also shown. The lengths of the corresponding BMP4 DNA 
fragments of the seven expression constructs are estimated to be about 9kb, 
about 8.3kb, about 7.5kb, about 5.2kb, about 4.8kb, about 4.5, and about 2.7kb 
for Age l-GFP, EcoR l-GFP, Bgl ll-GFP, Hind lll-GFP, Exo III A-GFP, Exo III B- 
GFP, and Exo III C-GFP, respectively, and are shown in parenthesis. Partial exon 
1 containing the 5'-untranslated region is shown by the black box. An arrow also 
denotes the transcription start site, designated as +1. An enhanced green 
fluorescent protein (GFP) is used as an exemplary reporter gene. In addition, an 
expression construct containing CMV promoter/enhancer cloned into the GFP 
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expression vector is used as a control. These expression constructs are linearized 
and microinjected into zebrafish 1 -cell zygotes for transient expression analyses 
and generating stable transgenic fish cell lines. 



Table 1. Transient expression analysis of BMP4 promoter activity in zebrafish 
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[0075] Results of transient transgenic analyses and deletion analyses 
demonstrate the presence of negative and positive cis-acting regulatory DNA 
sequences and tissue-specific regulatory sequences. As shown in Figures 6-7 
and Table 1, GFP expression directed by the 9 kb and 7.5 kb BMP4 promoter 
region in zebrafish embryos is observed mostly in the heart (about 10 % to about 
20 % of the long-pec embryos tested), as well as in the skin and muscle, such as 
the hatching gland (about 0.6 % to about 3.1 % of the long-spec embryos tested) 
and caudal fin (about 4.4 % to about 5.0 % of the long-spec embryos tested). 
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Figures 6A-6F demonstrate microscopic images of zebrafish embryos injected 
with the Bgl ll-GFP construct containing the 7.5-kb BMP4 DNA sequences for 
transient expression analysis of BMP4 promoter activity and exhibit the tissue- 
specific expression pattern directed by this 7.5-kb BMP4 DNA sequences. 

[0076] Figures 6A-6F demonstrate microscopic images for transient expression 
analysis of BMP4 promoter activity in zebrafish embryos injected with the Bgl ll- 
GFP construct containing the 7.5-kb BMP4 promoter and regulatory DNA 
sequences. Figures 6A and 6B are the fluorescence microscopy images (in DIC 
mode) of lateral view of prim-8 and long-pec embryos, respectively. Prim-8 
embryos are from an early developmental stage such as 26 hpf (hour post 
fertilization), whereas long-pec embryos are at 48 hpf stage and have been 
hatched out of corion. Figures 6C and 6D are the fluorescence microscopy 
images using FITC filter, showing GFP expression in the heart of prim-8 and long- 
pec embryos, respectively. The localized heart-specific expression, observed by 
the fluorescence of GFP protein, is also indicated by an arrow. Figures 6E and 6F 
are the microscopic images after immunohistochemistry and cryostat sectioning, 
showing GFP localization in the heart of prim-8 and long-pec embryos, 
respectively. Arrows indicate GFP localization in the heart. Scale bars represent 
100 jam. The results demonstrate that the 7.5-kb BMP4 DNA sequences contain 
a BMP4 promoter and DNA sequences required for tissue-specific expression 
such as heart-specific expression during early developmental stage, such as in 
prim-8 embryos, as shown in Figures 6A, 6C, and 6E and in long-pec embryos, as 
shown in Figures 6B, 6D, and 6F. The results confirm constitutive tissue-specific 
expression sequences in the cloned 9kb BMP4 promoter region (SEQ ID NO. 1). 
The results also suggest BMP4 expression in other cells or tissues is controlled by 
other regulatory regions located more distal to the transcription start site outside of 
this 9kb BMP4 promoter and regulatory region. Alternatively, there may be a 
negative regulatory factor or factors in zebrafish embryos during embryo 
development to inhibit BMP4 expression directed by the 9kb BMP4 promoter 
region in tissues other than heart, hatching gland, or caudal fin. 

[0077] Figures 7A-7B demonstrate the percentage of GFP expression in a 
zebrafish embryos population for transient expression analysis of BMP4 promoter 
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activity. The tissue-specific expression of GFP is analyzed in embryos injected 
with various recombinant GFP constructs containing different lengths of the BMP4 
promoter and upstream regions. Long-pec embryos injected with respective 
constructs, Age l-GFP (9.0), EcoR l-GFP (8.3), Bgl ll-GFP (7.5), Hind lll-GFP 
(5.2), Exo III A-GFP (4.8), Exo III B-GFP (4.5), and Exo III C-GFP (2.7) are 
examined under a fluorescence microscope using an FITC filter. The results of 
various recombinant GFP deletion constructs are shown in Figure 7A for heart- 
specific expression (solid circle) and in Figure 7B for expression in other regions 
(solid square), such as skin and muscle of long-pec embryos. Also shown in gray 
bar in Figures 7A and 7B are GFP tissue-specific expression in long-pec embryos 
injected with a control construct containing CWenhancer/promoter-GFP. 

[0078] In Figure 7, the highest level of GFP expression is observed in the 7.5 kb 
Bgl ll-GFP expression construct, higher than the full-length 9kb Age l-GFP 
construct. The results suggest a negative cis-acting regulatory sequence is located 
in the deleted DNA fragment from the 9kb Age l-GFP construct to the 7.5 kb Bgl ll- 
GFP construct, such as about 1.5 kb of the Age l-Bgl II genomic fragment or about 
0.8 kb of the EcoR l-Bgl II fragment. However, co-injection experiments in Figure 8 
reveal no significant change in heart-specific expression percentage when the 1 .5- 
kb Age \-Bgl II DNA fragment is added and compared to the 2.7-kb Exo III C-GFP 
expression construct. These results suggest that a negative effect is difficult to 
recover and/or the 1.5-kb Age \-Bgl II DNA fragment has to compete with a 
negative trans-acting transcription factor during co-injection analysis. 

[0079] Figure 8 demonstrates co-injection of different DNA fragments into 
zebrafish embryos to direct heart-specific expression in zebrafish. The 
percentages of GFP direct heart-specific expression are compared in long-pec 
embryos injected with different DNA expression constructs. The DNA constructs 
contain either Exo III C-GFP, which has minimal promoter activity, alone, together 
with a 2.4-kb Bgl W-Hind III DNA fragment, or with a 1.5-kb Age \-Bgl II DNA 
fragment. The results confirm a positive regulatory region within the 2.4-kb Bgl II- 
Hind III DNA fragment that controls heart-specific GFP expression. In addition, 
reconstituting gene expression by ligating a promoter containing DNA fragment 
with other cis-acting regulatory DNA sequences or by co-transfecting (or co- 
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injecting) a promoter containing DNA fragment and other cis-acting regulatory 
DNA sequences is also useful to study function and the promoter region and the 
cis-acting regulatory DNA sequences. For example, it was found that co-injecting 
the DNA sequences from intron 1 (SEQ ID No. 8) with either the 7.5kb Bgl II or a 
24 kb Asc l-Xho I (described in detail below) expression constructs into zebrafish 
1 cell zygote further increases heart-specific GFP expression of the injected 
embryos by two-fold. The results suggest intron 1 of BMP4 gene includes an 
enhancer sequence. The existence of transcriptional enhancer sequences 
outside of the promoter, distal and proximal regulatory regions further indicate the 
complexity of the transcriptional regulatory mechanisim in controlling zebrafish 
BMP4 gene expression during development and in different tissues. 

[0080] Referring back to Figure 7A, decreasing level of GFP expression is 
observed from Bgl ll-GFP (7.5) to Exo III C-GFP (2.7) suggests a positive cis- 
acting regulatory sequence is located in the deleted DNA fragment from the 7.5 kb 
EcoR l-GFP construct to the 2.7 kb Exo III C-GFP (2.7), such as about 4.8 kb of 
the Bgl ll-Exo III C genomic DNA fragment or about 2.4 kb of the Bgl II- Hind III 
fragment. In addition, co-injection experiments in Figure 8 further confirm the 
existence of positive regulatory elements in the 2.4-kb Bgl W-Hind III DNA region. 
In Figure 8, addition of the 2.4-kb Bgl W-Hind III DNA fragment increases heart- 
specific GFP expression of the injected embryos by 5.3-fold. 

[0081] As another example, a genomic DNA fragment (Asc l-Xho I) of about 24 
kb containing the promoter region plus the proximal and distal regulatory DNA 
sequences in an expression vector (EGFP-ITR) is also obtained. This expression 
construct is isolated by ligating (reconstituting) a genomic DNA fragment containing 
about 19.5 kb of distal regulatory DNA sequences (SEQ ID No. 9) into the 9 kb 
Agel-Xhol expression construct. Microinjection of this expression construct alone 
into zebrafish 1 cell zygote showed GFP expression in the heart (5-16%) and other 
regions (e.g., notochord, muscle, and skin (10-17%)). However, co-injecting this 
expression construct with a 6 kb intron 1 DNA fragment (SEQ ID No. 8) results in 
GFP expression in the heart (19-20%), hatching gland (19-37%), caudal fin (21- 
38%), and other organs. 
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V. Tissu -Specific Expression and Gene Expr ssion during Embryonic 

Dev lopm nt 

« 

[0082] This invention provides a method for the isolation of transcriptional 
regulatory elements that contribute to the tissue-specific patterns of zebrafish 
genes, such as BMP4 genes. Tissue-specific gene expression includes but is not 
limited to gene expression observed solely or preferably in certain tissues, 
environmental situations and during certain stages of development. The invention 
further provides a method for isolation of transcriptional regulatory elements that 
contribute to tissue-specific gene expression in fish, such as heart-specific BMP4 
expression in zebrafish as directed by the 9kb Age I- Xho I promoter and proximal 
regulatory region. Transcriptional regulatory elements and expression vectors 
containing the transcriptional regulatory elements are disclosed. The transcriptional 
regulatory elements drive tissue-specific gene expression in transgenic zebrafish. 
The transcriptional regulatory elements are utilized to generate expression 
constructs using various expression vectors. The expression vectors contain a 
transcriptional regulatory region that includes a tissue-specific element isolated by 
the methods of this invention operably linked to a heterologous reporter gene that, 
upon expression of the protein product of the reporter gene, confers an assayable 
product for the expression of the transcriptional regulatory elements. 

[0083] For example, we have examined the activity of the bone morphogenetic 
protein 4 (BMP4) promoter in zebrafish embryos via transient and stable 
transgenic expression analyses in order to obtain a better understanding of the 
regulation of BMP4 tissue-specific expression. Stable transgenic lines are 
generally prepared to confirm tissue-specific expression of an identified promoter 
region, such as the heart-specificity of the 9.0kb BMP4 promoter region in 
zebrafish. As an example, stable transgenic zebrafishes containing the 7,5-kb 
BMP4 promoter-GFP transgene are generated from F 0 adult fish of a transgenic 
embryo cell line containing the 7.5kb Bglll-GFP expression construct. The Fi 
progeny from F 0 adult fish containing fluorescent hearts after crossing with wild 
type fish is analyzed during embryonic development of the Fi progeny. The results 
are shown in Figure 9 and Table 2, demonstrating that the 7.5-kb BMP4 promoter 
region and the proximal upstream regulatory region drive heart-specific GFP 



30 



Attorney Docket No.: ACAD/0002 
Express Mail No. EV324942758US 

expression, specifically at the myocardium of the ventricles, which are observed in 
long-pec Fi embryos, protruding-mouth Fi embryos, and 100-hpf (hour post 
fertilization) transgenic Fi embryos. Long pec embryos are at 48 hpf stage, 
protruding mouth embryos are at 72 hpf developmental stage, and 100 hpf 
embryos is at a later developmental stage (typically, embryos older than 72 hpf is 
at their larval stages). As shown in Figure 9, GFP expression is localized in the 
myocardium of developing ventricles of all three types of Fi embryos. In addition, 
trabeculation of the myocardium is readily observed in 100-hpf F 1 embryos. 



Table 2. Inheritance of BMP4 promoter-GFP (Bgl ll-GFP construct) in transgenic 
zebrafish lines 



Transgenic F 0 line 


F 0 sex 


F^ positive fish 


GFP location 






No. 


% 




A 


male 


238 


11.3 


ventricle 


B 


male 


360 


53.1 


ventricle 


C 


male 


195 


4.1 


ventricle 


D 


male 


285 


9.8 


ventricle & atrium 


E 


female 


59 


22.0 


ventricle & atrium 


F 


female 


286 


19.9 


ventricle & atrium 


G 


male 


158 


22.8 


ventricle (weak) 


H 


male 


208 


4.8 


ventricle & atrium 


K 


female 


208 


15.4 


ventricle & atrium 


L 


female 


218 


7.3 


ventricle & atrium 



a No. in Fi positive fish indicates total examined embryos 



[0084] Together, these results indicate that the proximal 7.5-kb BMP4 
promoter and regulatory DNA sequences contain transcriptional regulatory 
elements for heart-specific BMP4 expression, while tissue-specific regulatory 
elements for other endogenous BMP4-expressing tissues may reside in more- 
distal regions. Specifically, our results indicate that the 7.5-kb BMP4 promoter 
and its upstream proximal regulatory region contain both positive and negative 
regulatory elements that control heart-specific GFP expression in zebrafish 
embryos. In addition, this promoter can direct GFP expression in the myocardium 
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of the ventricles of Fi embryos from F 0 fish of a transgenic B line. However, it 
does not contain the complete regulatory region that modulates expression of the 
BMP4 gene in other organs, such as the eye, otic vesicle, hatching gland, 
pronephric duct, anus, pectoral, caudal fin, etc. One such example is the 19.5 kb 
distal regulatory region (SEQ ID No. 9), which contains DNA elements to direct 
BMP4 expression in eye, otic vesicle, hatching gland, anus, and caudal fin. 

VI. Enhancer Sequences 

[0085] The invention also provides enhancer sequences for BMP4 expression. 
For example, DNA sequences from the intron 1 of the zebrafish bone 
morphogenetic protein 4 (BMP4) gene suggest the presence of an enhancer 
sequence, as shown in SEQ ID No. 8. About 6 kb of the intron 1 was sequenced. 
Identification of an enhancer sequence is confirmed by co-injection of this 6 kb 
DNA segment with either 7.5kb Bgl II or 24 kb Asc l-Xho I expression constructs 
into zebrafish 1 cell zygote. GFP expression in the heart is increased by two fold 
due to the presence of this intron 1 -enhancer element. In addition, increased 
gene expression is also demonstrated in other tissues, such as eye, hatching 
gland, caudal fin, etc, when this intron 1 -enhancer element is present. 

VII. Use of Zebrafish BMP4 Gene, Promoter Regions, and Enhancer 
Sequences 

[0086] In general, the identification of positive and negative c/s-acting 
regulatory sequences, enhancer sequences, and tissue-specific regulatory 
elements are used in methods of the invention for screening an extracellular or 
intracellular potential agent, compound, stimulus, inhibitor, regulator, and/or any 
frans-acting factor that regulates BMP4 expression. In one aspect, Figure 4 
depicts a method 400 of identifying a potential agent, compound, stimulus, 
inhibitor, regulator, and/or frans-acting factor for BMP4 expression. At step 410, a 
recombinant expression vector containing DNA sequences for bone 
morphogenetic protein 4 (BMP4) gene, and one or more regulatory DNA 
fragments, such as promoter regions, proximal and distal regulatory regions, 
enhancer sequences, tissue-specific regulatory elements, and derivatives and 
fragments thereof is introduced into a cell. The recombinant expression vector 
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also contains DNA sequences of a heterologous expression product, including, 
but not limited to, the luciferase gene, the Green Fluorescent Protein (GFP) gene, 
the chloramphenicol Acetyl Transferase gene (CAT), human growth hormone, 
alkaline phosphatase, p-glucuronidase, (3-galactosidase, and any of the 
heterologous expression products whose expression can be assayed. DNA 
sequences for zebrafish bone morphogenetic protein 4 (BMP4) gene and 
promoter sequences include, but are not limited to, various introns and exons of 
the BMP4 gene, 5' flanking regions, positive and negative c/s-acting regulatory 
sequences for BMP4 expression, enhancer sequences for BMP4 gene, any of the 
proximal and distal BMP4 regulatory regions, tissue-specific regulatory elements, 
cell-specific regulatory elements, and DNA fragments and derivatives thereof. For 
example, the BMP4 DNA sequences include any of the sequences described 
herein, including, but not limited to, SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. 4, 
SEQ ID No. 7, SEQ ID No. 8, and SEQ ID No. 9, etc. The recombinant 
expression vector containing BMP4 DNA sequences can be introduced into the 
cell through any of the techniques for delivering DNA fragments inside a cell, 
including, but not limited to, microinjection, eletroporation, transfection, 
transformation, and others. The cell recipient includes any of the cell types, cell 
lines, or embryos and may preferably be at various stages during development for 
comparison. 

[0087] At step 420, the cell is contacted with a candidate compound, generally 
through a screening method from a panel of suitable compounds, agents, 
substances, screening library, or a construct containing any of the suitable 
potential agents, compounds, regulators, stimuli, inhibitors, and any frans-acting 
factors. At step 430, the expression level of the heterologous expression product 
is monitored. For example, an altered expression level in the presence of the 
candidate compound indicates the candidate compound as the potential agent, 
compound, stimulus, inhibitor, regulator, and/or frans-acting factor for BMP4 
expression. Such an altered expression level can be an increased or decreased 
level of BMP4 expression. 

[0088] As an example, a method for identifying a potential agent for zebrafish 
tissue-specific expression includes introducing into a cell a tissue-specific 
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expression sequence operatively linked to a nucleotide sequence for a 
heterologous expression product. Suitable tissue-specific expression sequence 
includes, but is not limited to, the 9.0k b proximal BMP4 promoter region, the 19.5 
kb distal regulatory region, various genomic DNA fragments containing BMP4 
gene, and portions or derivatives thereof, for example, SEQ ID No. 1, SEQ ID No. 
4, SEQ ID No. 7, SEQ ID No. 8, and SEQ ID No. 9, etc. 

[0089] In another aspect, Figure 5 depicts a method 500 of screening an 
effecter that regulates bone morphogenetic protein-4 expression. At step 510, a 
transgenic cell having a portion of a bone morphogenetic protein-4 regulatory 
DNA fragment operatively linked to a heterologous expression product is 
constructed. The transgenic fish can be the transient transfected cells and 
embryos, and cells from transient F 0 adults expressing the heterologous 
expression product, and preferably cells from stable transgenic Fi embryos and 
adults expressing the heterologous expression product. The regulatory DNA 
fragment can be any of the 5' flanking regions, enhancer sequences, promoter 
regions, genomic DNA or cDNA of the BMP4 gene, and derivatives thereof, for 
example, SEQ ID No. 1, SEQ ID No. 4, SEQ ID No. 7, SEQ ID No. 8, and SEQ ID 
No. 9, etc. 

[0090] At step 520, a foreign DNA is introduced into the transgenic cell of the 
invention. For example, the foreign DNA can be a DNA fragment screened from a 
cDNA library, genomic DNA library, and any DNA construct containing suitable 
potential agents, compounds, stimuli, inhibitors, regulators, and trans-acting 
factors. At step, 530, the expression level of the heterologous expression product 
for a number of the transgenic cell containing the foreign DNA is monitored. An 
altered expression level in the presence of the foreign DNA as compared to the 
absence of the foreign DNA indicates the foreign DNA encodes the effecter. 

[0091] Recently, several mutations that disrupt cardiac chamber formation 
have been identified in zebrafish. While, our BMP4 promoter transgenic lines 
would allow in vivo imaging of cardiac morphogenesis during various stages of 
heart development, in addition, mutagenesis analyses on these transgenic 
embryos would identity additional genes that may regulate BMP4 function in 
cardiac development. 
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[0092] In addition, a method for identifying an expression pattern of a zebrafish 
BMP4 expression sequence is provided. The method includes providing a 
zebrafish BMP4 expression sequence which is operatively linked to a nucleotide 
sequence of a heterologous expression product, introducing into a cell the 
zebrafish DNA expression sequence, and monitoring the expression level of the 
heterologous expression product. As a result, the expression pattern for the 
expression of BMP4 gene by the expression sequence is identified at various 
stages during embryonic development, during morphogenesis, during 
organogenesis of a specific organ, or during formation of a specific tissue. For 
example, GFP expression directed by the 7.5kb Bgl ll-Xho I DNA fragment is useful 
for in vivo imaging of BMP4 expression pattern during zebrafish heart development 
and cardiac morphogenesis, such as cardiac chamber formation as described in 
Figure 9. 

[0093] The expression pattern of zebrafish BMP4 expression can also be 
analyzed by other methods. For example, we have performed whole-mount in situ 
hybridization to analyze endogenous gene expression at mRNA level using DNA 
probe prepared from the BMP4 DNA of the invention. 

EXPERIMENTAL 

[0094] The following examples are put forth so as to provide those of ordinary 
skill in the art with a complete disclosure and description of how to make and use 
the subject invention, and are not intended to limit the scope of what is regarded 
as the invention. Efforts have been made to ensure accuracy with respect to the 
numbers used (e.g., amounts, temperature, concentrations, etc.) but some 
experimental errors and deviations should be allowed for. Unless otherwise 
indicated, parts are parts by weight, molecular weight is average molecular 
weight, temperature is in degrees centigrade, and pressure is at or near 
atmospheric. 
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EXAMPLE 1: Fish Maint nance 

[0095] Adult zebrafish (Danio rerio) were raised at the zebrafish facility in the 
Institute of Zoology, Academia Sinica. The fishes were maintained in 20 liter 
aquariums supplied with filtered fresh water and aeration under a photoperiod of 
14 hour light and 10 hour dark as described in The Zebrafish Book (Westerfield, 
1995). 

[0096] Cleavage-stage embryos represented a mixture of embryos between 2- 
cell and 16-cell. Blastula-stage embryos represented a mixture of embryos at 
512-cell or higher stage. Gastrula-stage embryos represented a mixture of 
embryos between shield and 75% epiboly. Segmentation-stage embryos 
represented a mixture of embryos between 14-somite and 20-somite. 
Pharyngula-stage embryos represented a mixture of embryos between prim-5 and 
prim-25. 

EXAMPLE 2: Total RNA Isolation 

[0097] Total RNA from embryos at different developmental stages was isolated 
using guanidinium thiocyanate-phenol-chloroform extraction method. Total RNA 
from different adult organs and tissues was extracted using RNAzol B following 
the protocol from manufacturer (Tel-Test, Inc.). For the RT-PCR reaction, total 
RNA was first digested with RNase-free DNase 1 (150 units in a 200-//I reaction 
volume) in 40mM Tris-HC1 pH 7.5, 6mM MgCI2, 10mM NaC1 at 37°C for about 
30 min, followed by phenol-chloroform extraction. DNA and RNA concentrations 
were determined spectrophotometrically (Hitachi U2000). 

EXAMPLE 3: RT-PCR 

[0098] PCR reactions (100 //I) were performed using a mixture containing 10 
/jg of genomic DNA, 100 pmole primers, 5 mM MgCI2, 0.2, mM dNTP, and 2.5 
units of Replitherm DNA polymerase (Eicentre). PCR cycles were set as follows: 
1 min at 94°C, 1 min at 50°C, 1 min at 72°C for 35 cycles, 10 min at 72°C for 1 
cycle and the resulting product was stored at 4°C. DNase-l treated total RNA 
from embryos at different developmental stages (3 jjg) or from different adult 
organs and tissues (0.5 /vg) and components from GeneAmp RNA PCR kit (Perkin 
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Elmer) were used to generate the first-strand cDNA. Twenty microliters of cDNA 
products were then used in a PCR reaction as described above with a pair of two 
BMP4-specific primers. The two BMP4-specific primers are: 5'-TGGTCACA- 
TTCGGACATGACGGCA-3' and 5'-AGA/GTCTCCGTTTACCGGCAGCCA-3\ 
PCR conditions were set as follows: 1 min at 94°C for 1 cycle, 1 min at 94°C, 1 
min at 64°C, 1 min at 72°C for 35 cycles, 10 min at 72°C for 1 cycle, and the 
resulting product was stored at 4°C. The control RT-PCR reaction involved or-actin 
specific primers, which are 5TCACACCTTCTACAACGAGCTGCG-3' and 5- 
GAAGCTGTAGCCTCTCT- CGGTCAG-3' for synthesizing a-actin control 
transcript. 

EXAMPLE 4: Zebrafish Genomic DNA Library Screening 

[0099] To generate a probe for screening a genomic DNA library, a pair of 
degenerate oligonucleotides was designed according to conserved regions 
obtained from amino acid sequence alignment among various mammalian BMP4 
genes. The two degenerate primers are 5'-GAT/C TTT/C T/AC/GI GAT/C GTI 
GGI TGG AA-3' and 5'-CA ICC T/CTC IAC CAT T/CTC T/CTG-3'. Sequence 
analysis of a 270-bp PCR product obtained using the two conserved degenerate 
oligonucleotides as primers revealed that its deducted amino acid (corresponding 
to amino acids 308-396 of the isolated zebrafish BMP4 gene) shared 89% 
sequence identity with the amino acid sequence of human BMP4. The 270-bp 
PCR clone containing the BMP4 carboxyl-terminal domain was also used to 
generate DIG-labeled DNA probe following the manufacturer's protocols 
(Boehringer Mannheim) and to screen for zebrafish BMP4 gene. In order to 
obtain zebrafish BMP4 gene and DNA fragments containing the BMP4 promoter 
and its regulatory regions, a lambda FIX II zebrafish genomic DNA library (kindly 
provided by Dr. C. Y. Chang from the Institute of Zoology, Academia Sinica) was 
screened using several fragments of DIG-labeled DNA located in the 5'- 
untranslated region (UTR) region as probes and according to standard protocols 
in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, Plainview, N.Y. Lambda DNA purification was conducted as 
described in Donovan et al. (1993). Accordingly, the zebrafish genomic DNA 
library was screened using the 270-bp PCR product as a probe and several 
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positive clones were obtained after screening more than 1 .4 x 1 0 7 phage colonies. 
The average size of the isolated DNA fragment was around 14.9 kb. Positive 
genomic clones were isolated and the inserted DNA fragments were digested with 
Not I. These genomic fragments were subcloned into a pBluscript II vector and 
sequences. DNA sequencing was performed using a BigDye terminator cycle 
sequencing ready reaction kit on an ABI Prism 377 automatic DNA sequencer 
(Applied Biosystems). 

[00100] In addition, 5'-RACE (5'- rapid amplification of cDNA ends, Clontech) 
was conducted in order to verify the transcription start site. Typically, poly(A)+ 
mRNA or total RNA isolated from embryos were used as templates. Random 
hexamer or gene specific oligonucleotide (SP1 primer supplied in the kit) was 
used as primers for a reverse transcription (RT) reaction to generate RNA-cDNA 
hybrid product. Terminal transferase was then used to add several dATP to the 
3'end of 1 st -strand cDNA. First PCR amplification was then conducted using gene 
specific oligonucleotide (SP2 primer used) and oligo (dT)-anchor primer as a 
primer pair and the RNA-cDNA hybrid product as template in a first PCR reaction. 
Nested PCR can be performed using PCR product from the first PCR reaction as 
template and gene specific oligonucleotide (SP3) and anchor primer as a primer 
pair in a second nested PCR reaction. The final PCR product from the second 
nested PCR reaction or even from the first PCR reaction can be cloned into 
pGEMT vector (Roche) for sequencing and identification of the transcription start 
site. 

Example 5: Northern Blot Analysis 

[00101] About 25 /jg of total RNA was ethanol precipitated and loaded on 1 .2% 
agarose gels containing 1.2 M formaldehyde and EB buffer. Gels were 
transferred to nylon membranes for 4 hour using a downward alkaline transfer 
method. Membranes were prehybridized for 2 hour in 5X SSC, 50% formamide, 
2% blocking solution, 0.1% sodium lauryl sarcosine, and 0.02% NaDodS04 for 5 
min twice at room temperature followed by 0.1XSSC, 0.1% NaDodS04 wash for 
15 min twice at 65°C. CSPD chemiluminescent detection was conducted 
following protocols from the manufacturer (Boehringer Mannheim), except that 1X 
phosphate-buffered saline (PBS; 137 mMNaCI, 2.7 mM KC1, 10.1 mM 
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Na2HP04, 1.8 mM KH2P04) containing 5% nonfat milk and 0.3% Tween-20 was 
used in place of blocking solution. 

Example 6: Southern Blot Analysis 

[00102] About 10 microliters of RT-PCR products were loaded on 1.5% 0.5X 
TBE agarose gels and denatured, neutralized, and transferred according to 
standard methods (Sambrook et al., 1989). Membranes were hybridized in the 
same pre-hybridization buffer that was used for Northern blot containing 25 ng/ml 
of 270 bp DIG-labeled DNA probes at 42°C overnight. After hybridization, 
membranes were washed with 2X SSC, 0.1% NaDodS04 for 5 min twice at room 
temperature followed by 1X SSC, 0.1% NaDodS04 washed for 15 min twice at 
55°C. AMPPD chemiluminescent detection was carried out following instruction 
from the manufacturer (Boehringer Mannheim) and the same buffer used in the 
Northern procedure. 

Example 7: Generation of Fusion Protein and Production of Polyclonal 
Antibodies 

[00103] A 1.5-kb Xba l-Eco Rl subclone containing most of BMP4 coding region 
and 3' noncoding region was used to generate in wfro-transcribed DIG-labeled 
antisense RNA. To prepare zebrafish BMP4 fusion protein, a 270-bp Bam Hl- 
Hind III restriction fragment containing the BMP4 carboxyl-terminal region was 
inserted into BAM HI and Hind Ill-digested pQE-30 vector. This fragment encoded 
amino acids 308-396 of zebrafish BMP4 protein. Purification of H/s-tagged BMP4 
fusion protein was conducted according to protocols of the manufacturer (Qiagen). 

[00104] Polyclonal antibodies were raised against the purified fusion protein 
according to standard methods (Harlow and Lane, 1988). Crude antisera were 
affinity purified using the Olmstead-style strip purification method. To evaluate the 
specificity of anti-BMP4 antisera, an overnight culture of Escherichia coli cells 
transformed with pQE30 containing the 270-bp BamH l-Hind III fragment of BMP4 
gene was diluted 1:10 in fresh LB medium and induced with IPTG following 
protocols from the manufacturer (Qiagen). Escherichia coli total protein were 
separated by 15% SDS-PAGE and transferred onto nitrocellulose membranes as 
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described in the Western Blot Analysis section, except that 5% nonfat milk in 
PBST (1 X PBS and 0.1%Tween-20) was used as the blocking buffer and a 
colorimetric detection method involving nitro blue tetrazolium (NBT) and 5-bromo- 
4-cholro-3-indolyl phosphate (BCIP) was used for visualizing the purified fusion 
protein from E coli total lysate. 

[00105] The specificity of anti-BMP4 antisera was further tested against human 
recombinant proteins. Both human TGF-/?1 (GIBCO BRL Life Technologies, Inc.) 
and BMP2 recombinant proteins (0.3 jjg) together with cholate extractable total 
protein (19 fjg) isolated from 11-day-old zebrafish hatching larvae were separated 
by 10% NaDodS04-PAGE and Western Blotting was performed. 

Example 8: Western Blot Analysis 

[00106] Embryos from various developmental stages were harvested by 
centrifugation and subjected to homogenization in buffer containing 2% cholic 
acid, 10 mM Tris-HC1 pH 8.0 at 4°C, 10 //l/ml PIC 1 (protease inhibitor cocktail: 1 
mg/ml leupeptin, 2 mg/ml antipain, and 10 mg/ml benzamidine dissolved in 
aprotinin), and 1 mM phenylmethylsulfonyl fluoride (PMSF). Extracts were 
centrifuged and the supernatants used for Western blot analysis. The protein 
concentration was measured by Pierce BCA method. Samples of cholate- 
extractable total protein (25 fjg) were separated by 10% SDS-PAGE and 
electrophoretically transferred to nitrocellulose membranes using a transfer buffer 
containing 25 mM Tris pH 8.8, 192 mM glycine, and 20% methanol. The 
nitrocellulose membrane blots were blocked in a blocking buffer containing 5% 
BSA in PBST at 4°C overnight. 

[00107] The blots were incubated with affinity-purified anti-BMP4 antisera, pre- 
dicted 1 : 10,000, at room temperature for about one hour. After several washes in 
the blocking buffer, the blots were incubated with horseradish peroxidase- 
conjugated goat anti-rabbit IgG secondary antibody (Jackson Immuno Res. Lab., 
Inc.), pre-diluted 1:10,000) at room temperature for about one hour. After several 
washes in the blocking buffer and a 15 min wash in PBST containing 0.5 M NaCI, 
the blots were developed by an enhanced chemiluminescence method following 
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protocols from the manufacturer (ECL method, Amersham) to reveal the presence 
of protein-antibody complexes. 

Example 9: DNA Expression Constructs 

[00108] A 9kb Age \-Xho I genomic BMP4 DNA fragment was cloned into a 
modified EGFP-ITR expression vector containing a GFP gene that was flanked by 
inverted terminal repeats (ITRs) from an adeno-associated virus (AAV, Hsiao et 
al., 2001). The EGFP-ITR vector (Hsiao et al., 2001) was changed again to 
incorporate additional restriction enzyme sites by ligating it with annealed 
complementary oligonucleotides. For example, an Age l-GFP construct was 
generated by ligation of the 9-kb Age \-Xho I DNA fragment into the modified 
EGFP-ITR vector digested with Age I and Sal I. Similarly, EcoR l-GFP, Bgl ll-GFP, 
and Hind II l-GFP expression constructs were subcloned by ligation of a 8.3-kb 
EcoR \-Xho I, a 7.5-kb Bgl \\-Xho I, and a 5.2-kb Hind \W-Xho I DNA fragment into 
the modified EGFP-ITR vector digested with Sal I and respective restriction 
enzymes. Exo III deletion was generated using the Erase-a-Base system 
(Promega). DNA sequencing was conducted to confirm the DNA sequences at the 
5' end of various constructs. After restriction enzyme digestion, exo III nuclease 
deletion, ligation, transformation and confirmation by sequencing, several BMP4 
promoter-GFP recombinant expression constructs with different 5'-end BMP4 
DNA fragments to direct the expression of the GFP gene were generated. These 
deletion expression constructs containing different lengths of BMP4 DNA 
fragments subcloned into the EGFP-ITR expression vector are shown in Figure 3, 
illustrating the respective restriction sites on both ends and their lengths. 

[00109] Figure 3 illustrates total of 7 DNA expression constructs containing the 
heterologous reporter gene obtained. These expression constructs include Age I- 
GFP (9.0), EcoR l-GFP (8.3), Bgl ll-GFP (7.5), Hind lll-GFP (5.2), Exo III A-GFP 
(4.8), Exo IIIB-GFP (4.5), and Exo IIIC-GFP (2.7) with the length of the 
corresponding BMP4 DNA fragments in parenthesis. In addition, a control 
expression construct containing CMV promoter/enhancer cloned into the GFP 
expression vector is also obtained. The resultant gene constructs were 
respectively linearized and microinjected into zebrafish 1-cell zygotes for transient 
expression analyses and generating stable transgenic fish cell lines. 
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Exampl 10: Microinj ction in Zebrafish Embryos 

[00110] Various constructs with different 5' ends were linearized at the Sea I site 
located in the ampicillin-resistant gene of the EGFP-ITR vector. The control 
vector, CMV-EGFP- IT R, was linearized at the Not I site. Linearized DNA was 
purified by phenol-chloroform extraction and ethanol precipitation. The DNA pellet 
was dissolved in water, and approximately 4.6 nanoliters of solution containing 
about 6.2 fmoles of DNA was microinjected into the cytoplasm of zebrafish 
embryos at the 1-cell or 2-cell stage. A trace amount of phenol red was also 
added to aid injection. Microinjection was performed using a Nanoject II automatic 
injector (Drummond). 

Example 11: Evaluation of Various BMP4 Promoter-GFP Constructs via 
Transient Expression Analysis 

[00111] An example of transient expression analysis using the Bgl ll-GFP (7.5) 
expression construct is shown in Figures 6 and 7 and in Table 1. Heart-specific 
GFP expression was observed in the injected prim-8 embryos (Figures 6A and 6C) 
and long-pec embryos (Figures 6B and 6D). In addition, immunohistochemistry 
was performed on prism-5 and long-pec embryos from the transient transgenic 
analysis to further confirm heart-specific GFP expression directed by the Bgl ll- 
GFP (7.5) expression construct using an anti-GFP antibody following cryostat 
sectioning, as shown in Figures 6E and 6F. In general, approximately 10% of the 
injected embryos exhibit GFP expression in the heart. For example, Figure 7A 
demonstrate about 10% of the zebrafish population exhibiting GFP expression in 
the heart when injected with either the 9.0-kb Age l-GFP expression construct or 
the 8.3-kb Eco RI-GFP expression construct. 

[00112] Interestingly, about a two-fold increase in the percentage of the injected 
embryos exhibiting heart-specific GFP expression was observed when injected 
with the 7.5-kb Bgl ll-GFP expression construct as compared to other expression 
constructs. In Figure 7A, the 7.5-kb Bgl ll-GFP expression construct has obtained 
about 20% of heart-specific GFP expression, which is 4.4-fold higher than the 
CMV control expression construct. The CMV promoter/enhancer-GFP control 
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expression construct exhibits about 4.5% of heart-specific GFP expression, which 
represents a random probability of promoter directed GFP expression in the heart. 

[00113] Conversely, a decreased percentage of about 14% of the injected 
embryos exhibiting heart-specific GFP expression was observed when injected 
with the 5.2-kb Hind lll-GFP expression construct as compared to the 7.5-kb Bgl 
ll-GFP expression constructs. This result in Figure 7A suggests that a negative 
and a positive cis-acting regulatory DNA element are located in the 0.8-kb Eco Rl- 
Bgl II and the 2.4-kb Bgl W-Hind III regions. 

[00114] Overall, the results in Figure 7A demonstrated a decreasing percentage 
of the injected embryos exhibiting heart-specific GFP expression. When the 4.8- 
kb Exo III A-GFP, the 4.5-kb Exo III B-GFP, and the 2.7-kb Exo III C-GFP 
expression constructs were respectively injected, about 9.7%, about 6.6%, and 
about 2.4% of the injected embryos exhibit heart-specific GFP expression. The 
results are also summarized in Table 1 . 

[00115] GFP expression in other tissues directed by the 9.0-kb Age l-Xho I 
BMP4 promoter was also examined. As shown in Figure 7B and in Table 1, low 
levels of GFP expression of the injected long-pec embryos were observed in skin 
and muscle. The highest nonspecific GFP expression of about 8.4% in skin or 
muscle was observed when injecting the 2.7-kb Exo III C-GFP expression 
construct. Overall, BMP4 directed GFP expression exhibit a much lower 
expression percentage in the embryo population examined, as compared to the 
CMV promoter/enhancer-GFP control expression construct. The control shows 
approximately 88.1% of the injected embryos express GFP in regions such as 
skin and muscle, as shown as non-specific expression in Table 1 . 

[00116] In addition, low percentages of BMP4 directed GFP expression in the 
hatching gland of the injected long-pec embryos were observed. The results 
range from about 0.9% to about 3.1% of BMP4 directed GFP expression in the 
hatching gland and from about 0% to about 5% of BMP4 directed GFP expression 
in caudal fin for different expression constructs, as shown in Table 1. In general, 
the majority of the injected embryos exhibit no GFP expression, ranging from 
about 75% to about 91% of the embryo population. This is partly due to the 
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mosaicism of transgene distribution and partly due to the presence of the ITR 
sequence in the vector that enhances tissue-specific expression and thus inhibits 
nonspecific expression {Fu et al., 1 998; Ju et a/., 1 999; Hsaio et al., 2001 ). 

Example 12: Co-Injection Transient Expression Analysis of Various BMP4 
Promoter-GFP Constructs 

[00117] To further confirm the presence of a negative and a positive regulatory 
DNA element in the 0.8-kb Eco R\-Bgl II and the 2.4-kb Bgl \\-Hind III DNA 
fragments and clarify heart-specific expression directed by these two regulatory 
elements, co-injection experiments were performed. Two DNA fragments, a 2.4- 
kb Bgl W-Hind III DNA fragment and a 1 .5-kb Age \-Bgl II DNA fragment containing 
the 0.8-kb Eco Rl-Sg/ II region were individually co-injected in equal molar ratio 
with the 2.7-kb Exo III C-GFP expression construct which contains minimal 
promoter activity into zebrafish embryos. The results of the co-injection 
experiments are shown in Figure 8. 

[00118] Figure 8 show that the addition of the 2.4-kb Bgl, W-Hind III DNA 
fragment increase heart-specific GFP expression of the injected embryos by 5.3- 
fold as compared to the Exo IIIC-GFP (2.7) expression construct. For example, 
when the Exo III C-GFP (2.7) construct exhibiting minimal promoter activity is co- 
injected with the 2.4-kb Bgl W-Hind III DNA fragment, about 5.3 fold increase in 
heart-specific GFP expression is observed; in contrast, heart-specific GFP 
expression remains the same for embryos injected with the Exo IIIC-GFP 
construct alone or co-injected with the 1.5-kb Age \-Bgl II DNA fragment. The 
addition of the 1.5-kb Age \-Bgl II DNA fragment showed no significant change in 
heart-specific expression percentage when compared to the Exo IIIC-GFP (2.7) 
expression construct. These results confirm that the 2.4-kb Bgl W-Hind III DNA 
region contains regulatory elements required for heart-specific GFP expression. 

Example 13: Zebrafish BMP4 Promoter and Adjacent Upstream Region 
Contain a Heart-Specific Regulatory DNA Element 

[00119] Results of transient expression analyses in Figure 7 using various 
expression constructs containing different lengths of the BMP4 promoter and 
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proximal regulatory DNA sequences demonstrate that the 0.8-kb Eco Rl-Sg/ II 
DNA fragment contains negative c/s-acting regulatory DNA elements which inhibit 
heart-specific expression, while the 2.4-kb Bgl \\-Hind III DNA fragment contains 
positive c/s-acting regulatory DNA elements which enhance GFP expression in 
the heart. In addition, the co-injection experiment in Figure 8 confirms the 
presence of a positive heart-specific regulatory element in the 2.4-kb Bgl \\-Hind III 
DNA region. However, the fact that we did not observe the inhibitory effect of the 
1.5-kb Age l-8g/ II DNA fragment in the co-transfection experiments may be due 
to the low expression level of Exo IIIC-GFP (2.7) construct containing the minimal 
promoter itself and/or a significant decrease in expression level was hard to 
achieve in this case. Studies have shown that microinjected DNA fragments will 
undergo concatemerization into high molecular weight DNA complexes, which are 
then amplified in injected Xenopus and zebrafish embryos. The resultant high 
molecular weight DNA complexes position the regulatory element in close 
proximity to the promoter that enhances or inhibits expression of the reporter 
gene. 

[00120] Previously, for mouse BMP4 gene, the chicken ovalbumin upstream- 
transcription factor (COUP-TFI) has been shown to inhibit BMP4 promoter activity. 
Transcription factor motif analysis of the zebrafish BMP4 promoter and its 
upstream regulatory regions does not reveal the presence of a COUP-TFI binding 
site in the 1.5-kb Age l-Bg/ II DNA fragment. However, binding sites for 
transcription factors, such as CdxA, SRY, Nkx2.5, and AML-1a, are present in the 
2.4-kb Bgl W-Hind III region. The present invention provides methods for screening 
a potential agent, compound, and/or molecular regulator (e.g., transcription factor, 
activator, repressor, etc) which can interact with BMP4 promoter and requlatory 
DNA sequences and/or direct tissue-specific expression, such as BMP4 gene in 
the heart, eyes, otic vesicles, hatching gland, anus, caudal fin, and other tissues. 

Example 14: Different Zebrafish BMP4 Promoter Regions Directing Tissue- 
Specific BMP4 Gene Expression 

[00121] We have identified the presence of a heart-specific regulatory element 
in the 9 kb BMP4 promoter and 5'-upstream region, as well as other tissue- 
specific elements in a distal zebrafish BMP4 regulatory region. A 19.5 kb 
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zebrafish BMP4 regulatory region located distal to the 9 kb Age l-Xho I 5- 
upstream region was identified and obtained from genomic DNA screening. The 
19.5 kb distal zebrafish BMP4 regulatory region contain DNA fragments required 
to direct BMP4 gene expression in tissues other than heart, such as eyes, otic 
vesicles, hatching gland, anus, and caudal fin, as tested in long-pec embryos. 

[00122] In addition, we have obtained results from whole-mount in situ 
hybridization which demonstrate that BMP4 mRNA is expressed in a variety of 
tissues, such as eyes, otic vesicles, heart, pronephric ducts, hatching gland, anus, 
and pectoral and caudal fin buds at least in prim-5 (24 hpf) embryos. However, the 
9 kb Age I expression construct only directed GFP expression in the heart. Thus, 
the regulation of tissue-specific expression of the BMP4 gene is complex and we 
contemplate using cloned genomic BMP4 DNA fragments (e.g., SEQ ID No. 1, 
SEQ ID No. 4, SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, efc.) to perform 
detailed comparison of more deletion constructs and co-injection analysis, to 
delineate the complex mechanism regulating BMP4 gene expression, and to 
identify other factors, compounds, and DNA sequences involved in BMP4 gene 
expression. 

Example 15: Production and Inheritance of BMP4 promoter-GFP Transgenic 
Zebrafish 

[00123] We raised long-pec embryos, having injected 7.5 kb Bgl ll-GFP 
construct to direct heart-specific GFP expression, to sexual maturity. Crosses 
between F 0 adults and wild type fish of different sex were performed in order to 
obtain stable transgenic fish cell lines. The identification of transgenic progeny 
was conducted by examining GFP expression using fluorescence microscopy 
equipped with a FITC filter (Zeiss, Axioplan 2). 

[00124] The results are summarized in Table 2. Out of 56 F 0 adult fish, 25% 
(14) of them were found to have transmitted the transgene to the Fi generation. 
About 10 stable transgenic fish lines were further examined, designated as F 0 
transgenic fish A-L. The 10 transgenic F 0 lines listed in Table 2 comprise 6 male 
and 4 female adult fishes. The results further confirm that heart-specific GFP 
expression directed by the 7.5 kb Bgl ll-Xho I DNA fragment are inherited from F 0 
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transgenic lines to Fi embryos. As shown in Table 2, transmission rates of the Bgl 
ll-Xho I directed heart-specific GFP expression from Fo transgenic lines to Fi 
embryos range from about 4.1% to about 53% among different transgenic lines. 

Example 16: Immunohistochemistry and Cryostat Sectioning 

[00125] Immunohistochemistry was performed based on Park et ai (2000) with 
some modifications. In general, long-pec Fi embryos, protruding-mouth Fi 
embryos, and 100 hour-post-fertilization (hpf) Fi embryos from the transgenic B 
cell line (F 0 ) were fixed in a buffer containing 4% paraformaldehyde in PBS at 
about 4 °C overnight. After two rinses with about 1 ml of PBST buffer (1X PBS, 
0.1% TritonX-100), embryos were treated with acetone (prechilled at about -20 °C) 
for 7 min and washed with PBST three times. For 100-hpf Fi embryos, additional 
steps were performed in order to make the 100-hpf Fi embryos more permeable. 
For example, treatment with about 100 jug/ml of proteinase K for 30 min and 
longer paraformaldehyde fixation of about 20 min were performed before the -20 
°C acetone treatment. 

[00126] Embryos were then treated with PBS-DT blocking solution (1X PBST, 
1% BSA, 1% DMSO, 0.1% Triton X-100, 2% goat serum) for 1 hour, followed by 
incubating with pre-absorbed anti-GFP antibody (pre-diluted 1: 1000) at about 4 
°C overnight. After PBS-DT washes for approximately 4 hour, embryos were 
incubated in biotin-anti-rabbit IgG (1: 500; Vector) diluted with PBS-DT blocking 
solution at 4 °C overnight. Embryos were washed again with the PBST-DT 
solution for a total of about 6 hour time period and then incubated with ABC 
(Vector) reagent at room temperature for 2 hour, followed by several rinses with 
the PBS-DT solution. Thereafter, embryos were stained in 1 ml of a DAB/nickel 
solution (Vector) prepared according to manufacturer's protocols for 5-10 min and 
monitored under a microscope. The chromogenic reaction was terminated by 
washes with 0.1 M NaP0 4 (pH 7.4), followed by 4% paraformaldehyde fixation at 
about 4 °C overnight. 

[00127] Cryostat sectioning was conducted according to Westerfield (1995). 
Embryos were washed with a fix buffer (4% sucrose, 0.15 mM CaCI 2 , 0.1 M 
NaP0 4 , pH 7.4) about three times for 5 min at room temperature, followed by 
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embedding embryos in agar (1.5% agar, 5% sucrose, 0.1 M NaP0 4 , pH 7.4) into 
agar blocks. Agar blocks containing different embryos were soaked in 30% 
sucrose until they sank. Embryos in agar blocks were transferred to an embedding 
chamber filled with OCT cryostat embedding medium (Tissue Tek), frozen inside a 
cryostat (Leica CM1900). Cryostat sections at about 16|im interval were 
produced according to the manufacturer's procedures. 

Example 17: Zebrafish BMP4 Promoter and the S'-Upstream Region Drive 
GFP Expression in the Myocardium of Ventricles of Transgenic Fi Embryos 

[00128] The results from immunohistochemistry and cryostat sectioning of long- 
pec Fi embryos, protruding-mouth Fi embryos, and 100 hour-post-fertilization (hpf) 
Fi embryos demonstrate variation of GFP expression in different parts of heart 
tissue during embryonic development. As shown in Table 2, Fi progeny from 4 
transgenic lines exhibit GFP expression mainly in the ventricle with , different 
intensities, while those embryos from the other 6 transgenic lines contained GFP 
expression in both the ventricle and atrium. Therefore, Fi embryos from 
approximately 40% of the transgenic F 0 lines direct in vivo GFP expression mainly 
only in the ventricles, while those from other transgenic F 0 lines (about 60%) direct 
in vivo GFP expression in both the atrium and ventricle. The positional effect of 
different integration sites in the chromosome may have contributed to this 
variation. The results support a mechanism of germ cells mosaicism in F 0 founder 
fish. This result is consistent with previous transgenic fish reports. 

[00129] In general, endogenous BMP4 is expressed in bilateral heart primordia 
at the 18-somite stage, and BMP4 endogenous expression in long-pec embryos is 
restricted at the sinus venosus-atrial, atrio-ventricular, and ventriculo-arterial 
junctions. In our Fi embryos from the transgenic B line, GFP expression is readily 
observed as an oval-shaped tube at around 30 hour post fertilization, and strong 
GFP expression is detected in all ventricle regions in long-pec embryos. The 
difference between GFP expression directed by our heart-specific expression 
construct and the endogenous BMP4 expression pattern may have been due to 
lack of DNA elements that controls accurate temporal activation and maintenance 
of GFP expression in the 7.5kb Bgl II construct. 
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[00130] Fi embryos from the transgenic B line were chosen for further analysis 
of GFP expression in the heart during different developmental stages because of 
the abundant positive Fi progeny. The earliest GFP expression was observed in 
prim-15 Fi embryos having a conical to elongated shape. As development 
continued, GFP expression was located in the ventricles in long-pec, protruding- 
mouth, and 100-hpf embryos via both fluorescence and confocal microscopy, as 
shown in Figures 9A-9I. In addition, GFP expression in the heart of embryos could 
be observed as long as about 6 days after hatching. These results further 
confirmed that the 7.5-kb BMP4 promoter and its upstream region participate in 
heart-specific GFP expression. 

[00131] Figures 9J-9L exemplify the results of Fi embryos of the transgenic B 
line from immunocytochemistry using the anti-GFP antibody followed by cryostat 
sectioning. It is clear that GFP expression is localized in the myocardium of the 
ventricles of the three developmental stages of Fi embryos examined, long-pec 
(Figures 9J), protruding-mouth (Figures 9K), and 100-hpf (Figures 9L). In 
summary, GFP expression in embryos of the transgenic B line, localized in the 
myocardium of the ventricle is readily observed from about 48 hour to about 100 
hour post fertilization (hfp). 

[00132] In addition, the formation of trabeculae in the ventricle could be 
observed via staining of GFP in 100-hpf embryos, as shown in Figure 9L. This 
observation is consistent with a previous study showing the formation of 
trabeculae in the ventricle occurred at around 72 hours to 120 hours post 
fertilization. This is consistent with studies for higher vertebrate organisms where 
bone morphogenetic protein signaling, such as BMP4 signaling, have been shown 
to play an essential role in the induction of cardiac myogenesis. For example, 
bone morphogenetic proteins act as paracrine signals to specify cardiac myocyte 
induction in avian explant studies. 

[00133] In 9.0 days p.c. mouse embryos, BMP4 mouse transcripts were 
expressed in the outer myocardial layer of the developing atrioventricular canal. In 
addition, cardiac myocyte-specific deletion of the mouse type IA BMP receptor, 
ALK3, suggest that the ALK3 BMP receptor is specifically required for the 
development of the trabeculae, compact myocardium, interventricular septum, and 
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endocardial cushion. Our GFP localization results coincide with trabeculae 
formation (Figure 9L) suggests that expression of zebrafish BMP4, similar to 
mouse BMP4, acts as an autocrine signal for heart development. However, 
nucleic acid sequence comparison between the promoter and upstream regions of 
mouse and zebrafish BMP4 genes revealed no significant homology. 

Example 18: Establishment of Stable Transgenic Zebrafish Lines Expressing 
GFP in Their Hearts 

[00134] We further crossed adult Fi transgenic fish from several transgenic lines 
with wild type fish. The transmission rate from Fi to F 2 fall into the standard 
Mendelian inheritance ratio of 50%, indicating that the transgene {i.e., different 
BMP4 DNA fragments) in these transgenic lines had been integrated into a single 
chromosome locus (data not shown). 

Example 19: Photography and Computer Analysis 

[00135] Images of embryos from transient expression analysis, from the 
transgenic B line, as well as from cryostat sectioning were taken using an RT 
color digital camera (SPOT) on an Olympus BX60 microscope equipped with DIC 
mode and an FITC filter. Laser confocal microscopic images were obtained using 
a Leica TCS NT laser-scanning microscope equipped with an FITC filter. Nucleic 
acid sequences were analyzed using Lasergene software (DNASTAR). Motif 
( http://motif.genome.ad.ip ) was used to predict possible binding sites for 
transcription factors. 

[00136] While the foregoing is directed to embodiments of the invention, other 
and further embodiments of the invention may be devised without departing from 
the basic scope thereof, and the scope thereof is determined by the claims that 
follow. 
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